Method for producing RNA viruses from CDNA

ABSTRACT

The present invention relates to methods for producing RNA virus cDNA, methods for producing viable, RNA virus and viable, RNA virus produced by those methods. The invention also related to a novel RNA virus cDNA, recombinant DNA molecules containing that cDNA and hosts transformed with those recombinant cDNA molecules. This invention further related to novel methods for screening for variants of a strain 3 poliovirus. This invention also related to methods for increasing the attenuation of a strain poliovirus. This invention provides a vaccine useful for immunizing a subject, for example a human, against infectious poliovirus, wherein the vaccine comprises an effective amount of an RNA virus, produced by transforming a suitable host cell with a recombinant nucleic acid sequence which encodes for the virus: culturing the host cell under conditions which permit the production of virus: and isolating the virus so produced, effective to immunize the subject, and a suitable carrier. Further provided to this invention is a method of immunizing a subject such as a human against infectious poliovirus, wherein the method comprises administering the subject a suitable dose of the vaccine described hereinabove.

This application is a continuation of U.S. Ser. No. 07/852,260, filed Jun. 19, 1992, now U.S. Pat. No. 5,525,715, which is a U.S. national stage application of PCT International Application No. PCT/US91/05890, filed Aug. 20, 1991, which is a continuation-in-part of U.S. Ser. No. 07/569,916, filed Aug. 20, 1990, now abandoned, and a continuation-in-part of U.S. Ser. No. 07/570,000, filed Aug. 20, 1990, now abandoned.

BACKGROUND OF THE INVENTION

Throughout this application various references are referred to within parentheses or with arabic numerals within parenthesis. Full bibliographic citations for these publications referred to by arabic numerals may be found at the end of the specification immediately preceding the claims. The disclosures of these publications in their entireties are hereby incorporated by reference into this application to more fully describe the state of the art to which this invention pertains.

Human enteroviruses belonging to the family Picornaviridae are characterized by a single-stranded positive RNA genome. Members of this viral family include poliovirus, echoviruses, coxsackieviruses and rhinoviruses. Among these viruses, poliovirus has been the most extensively studied.

Poliovirus is known to be the causative agent of poliomyelitis, a paralytic disease of the central nervous system. This virus is known to exist in three stable serotypes--1, 2 and 3. For over 25 years, this disease has been controlled by the use of both the Sabin oral live-attenuated vaccine and the Salk inactivated virus vaccine. The Sabin vaccine consists of attenuated virus of each serotype, none of which are capable of causing disease. The strains used to produce the vaccine were created by a combination of extensive in vivo and in vitro passage of each of the three wild-type strains through monkey tissue. Upon oral administration, the live virus contained in the Sabin vaccine replicates in the gut, thereby inducing both systemic and local immunity. The killed virus (Salk) vaccine, which is administered intramuscularly, is limited to inducing systemic immunity.

Although the Sabin vaccine is considered to be a safe and effective protection against poliomyelitis, a small number of recipients have developed vaccine-associated the disease.

In an effort to understand the molecular basis of attenuation and reversion, the nucleotide sequences of cDNAs corresponding to each of the 3 attenuated strains and their wild-type progenitors, were compared A. Nomoto et al., "Complete Nucleotide Sequence of the Attenuated Sabin 1 Strain Genome", Proc. Natl. Acad. Sci. USA, 79, pp. 5793-97 (1982); G. Stanway et al., "Nucleic Acid Sequence of the Region of the Genome Encoding Capsid Protein VP1 of Neurovirulent and Attenuated Type 3 Polioviruses", Eur. J. Biochem., 135, pp. 529-33 (1983); G. Stanway et al., "Comparison of the Complete Nucleotide Sequences of the Genomes of the Neurovirulent Poliovirus P3/Leon/37 and its Attenuated Sabin Vaccine Derivative P3/Leon 12ab", Proc. Natl. Acad, Sci. USA, 79, pp. 1539-43 (1984); and H. Toyoda et al., "Complete Nucleotide Sequences of All Three Poliovirus Serotype Genomes", J. Mol. Biol., 174 pp. 561-585 (1984)!. The observed differences in nucleotide sequence between each wild-type progenitor and its resultant attenuated strain were then further analyzed to determine their relationship to the phenomenon of attenuation.

In serotype 3, for example, the attenuated strain differed from the wild-type strain by only 10 point mutations G. Stanway et al., (1984), supra!. Of these differences, only the changes at nucleotide positions 472 and 2034 were thought to be strongly associated with attenuation D. M. A. Evans et al., "Increased Neurovirulence Associated With A Single Nucleotide Change In A Noncoding Region of the Sabin Type 3 Poliovirus Genome", Nature, 314, pp. 548-50 (1985); G. D. Westrop et al., "Genetic Basis of Attenuation of the Sabin Type 3 Oral Poliovirus Vaccine", J. Virol., 63, pp. 1338-44 (1989)!.

Prior to the identification of the nucleotides which are linked to attenuation, it was demonstrated that cDNA synthesized from a viral RNA template ("RNA virus cDNA") could be utilized to produce viable poliovirus following transfection of mammalian cells V. R. Racaniello et al., "Cloned Poliovirus Complementary DNA Is Infectious In Mammalian Cells", Science, 214, pp. 916-19 (1981)!. Such observations created the possibility of producing improved polio vaccines via genetic engineering techniques. This could be achieved by altering the cDNA around the crucial nucleotides so as to minimize reversion to the wild-type nucleotide, while maintaining structural and functional integrity of the virus.

Despite the discovery that RNA virus cDNA can be used to produce viable virus, it has never been demonstrated that these cDNA are accurate copies of the viral RNA present in wild-type or vaccine virus. Moreover, the use of cDNA sequences to determine which nucleotides are linked to attenuation, may have caused one or more critical sites to have been overlooked. This is because the process used to produce cDNA, namely reverse transcription, is known to be errorprone I. M. Verma, "Reverse Transcriptase", In The Enzymes, Vol. 14, P. D, Boyer, ed., Academic Press, New York, pp. 87-104 (1981)!.

Accordingly, a need still exists for the production of RNA virus cDNAs which are truly complementary to the vaccine virus RNA. Moreover, the use of inaccurate RNA virus cDNAs may result in reduced attenuation, if these cDNAs are ultimately to be used to produce vaccines, such as polio vaccines.

The genome of poliovirus is a single-stranded RNA molecule of plus-sense that is approximately 7500 nucleotides in length. The error frequency associated with replication of single-stranded RNA, as for poliovirus, is especially high compared to that of double-stranded DNA (3). Due to this inherent property, every preparation of poliovirus including the original Sabin (SO) strains must be considered genotypically heterogeneous.

Culture conditions (ie. temperature, cell substrate) as well as the homogeneity of the input virus are likely to influence which genotype predominates during amplication of a poliovirus sample. It is therefore not surprising that authorities who regulate the manufacture of OPVs (ie. FDA and WHO) dictate strict guidelines regarding the production of manufacturing seeds as well as the passage level of the seed represented in vaccine (22, 26). These regulations were put into action as an effort to minimize selection and amplification of less attenuated variant strains.

It has been well documented that the attenuated phenotype of the Sabin 3 strain is less genetically stable than the type 1 and 2 vaccine strains (4, 7, 11). In the past, a new manufacturing seed (RSO) was derived from the original Sabin 3 virus by selecting a plaque produced in Vervet monkey kidney cell monolayers from extracted infectious RNA (19). The isolate was chosen based on increased stability of its sensitivity to grow at 40.3° C. (rct marker) during serial passage as well as increased attenuation in monkeys. The sensitivity of growth at temperatures above 37° (rct marker) is still employed as an in vitro biological test to analyze the quality of vaccine strains (13).

A report by Kohara et al. (9) suggested that an infectious cDNA clone might be used to preserve the constancy and quality of the Sabin 1 seed. It is plausible that a similar approach could also benefit the attenuated type 3 strain. Until recently, the literature contained two cDNA sequences for Sabin 3 which differed at nucleotide positions (17, 21). The divergence between these sequences may be due to the fact that passage derivatives and clonal isolates of Sabin 3 rather than actual vaccine virus were used for making the cDNA clones.

SUMMARY OF THE INVENTION

This invention also provides a vaccine useful for immunizing a subject, for example a human, against infectious poliovirus, wherein the vaccine comprises an effective amount of an RNA virus, produced by transforming a suitable host cell with a recombinant nucleic acid sequence which encodes for the virus; and isolating the virus so produced, effective to immunize the subject.

Further provided by this invention is a method of immunizing a subject such as a human against infectious poliovirus, wherein the method comprises administering to the subject a suitable dose of the vaccine described hereinabove.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 depicts a chromatographic profile of first-strand cDNA synthesized from poliovirus strain 3 RNA off a Sepharose CL-4B column.

FIG. 2 depicts the cDNA clones which mapped to the P3/Sabin genome.

FIG. 3 depicts the strategy for assembling partial cDNAs into a single, full-length P3/Sabin cDNA according to this invention.

FIGS. 4A and 4B depict the strategy for mutagenizing P3/Sabin cDNA and constructing a true, full-length P3/Sabin cDNA according to this invention. Depiction of the strategy continues in FIG. 5.

FIG. 4C depicts the following. The sequence of pLED3 depiced in FIGS. 6A-6J (SEQ ID NO. 1-2) was deduced from the complete analysis of pVR318 which was then altered at position 2493 by exchanging a SacI/HindIII fragment from the same subclone (pLL3-271) used to construct pVR318. However, confirmatory sequencing of the entire cDNA sequence in plasmid pLED3 revealed an additional "A" (adenosine) in a region where a run of six A's is normally found in the viral genome (position 4133-4138). The erroneous "A" was also found in subclone pLL3-271. The pLL3-271 and pBr318 were manipulated to make pLED3.2 which contains the correct number of A's at this site.

In order to reconstruct pLED3 with the correct number of A's, the SacI/HindIII fragment from pLL3-271 used above was cloned into bacteriophage M13 for oligonucleotide-directed deletion mutagenesis. An oligonucleotide spanning nucleotides 4121-4150 was synthesized for this purpose. The oligonucleotide has the sequence: 5'-CGCCTCAGTAAATTTTTTCAACCAACTATC-3' (SEQ ID NO. 3).

Mutagenesis was performed using a "T7-GEN In Vitro Mutagenesis Kit" (United States Biochemical, Cleveland, Ohio) and following the manufacturer's directions. The mutagenized insert was called CB2 and was demonstrated to possess six A's at positions 4133-4138 as well as C at 2493 by sequence analysis. The corrected full-length construct was made by ligating the SacI/HindIII fragment of CB2 to the SacI/HindIII partial digestion fragment of pVR318 which had been used in the original pLED3 construction. The product of this ligation was called pLED3.2. The entire cDNA sequence of pLED3.2 has been verified to match the sequence reported in FIGS. 6A-6J.

FIGS. 6A-6J, (SEQ ID NO. 1-2) depict the nucleotide sequence of a true type 3 poliovirus vaccine strain cDNA according to this invention.

FIGS. 7A and 7B. FIG. 7A depicts the structure of the plasmid containing full-length LED3 cDNA and the T7 RNA polymerase promoter. The large shaded area represents the poliovirus cDNA. The small open area indicates the T7 promoter (pT7) and the start site for in vitro positive polarity transcripts of the cDNA. The plasmid is cut with PvuI (sites shown) before run-off transcripts are synthesized. FIG. 7B depicts RNAs transcribed from cDNA clones by purified T7 RNA polymerase. Portions of the transcription reaction mixture were analyzed by electrophoresis in a 0.6% agarose gel as described in Materials and Methods, Example 13. Lanes 1-3 represent 0.75 μg, 0.50 μg and 0.25 μg of 7.5 kb ssRNA marker, respectively. HindIII-digested phage lambda DNA is shown in lane 4. Lanes 5 & 6 demonstrate transcription reactions containing PvuI-digested pLED3 and pVR318 templates, respectively. RNA extracted from pelleted virions is shown in lane 7.

FIG. 8, depicts SDS-polyacrylamide gel electrophoresis of LED3 and VR318 cDNA-derived viruses. A ³⁵ S! methionine-labeled sample was prepared and loaded in each lane and resolved by eletrophoresis as described hereafter. The gel was dried and the protein bands visualized by autoradiography. LED3 virus in lanes "A"; VR318 virus in lanes "B". The positions of prestained molecular mass markers (kilodaltons) are indicated on the left. Positions of viral capsid proteins are identified on the right.

FIGS. 9A-9C, depict plaque phenotype of Leon (wild-type), VR318 and LED3 viruses on Vero cells. After incubation at 33.5° C. for 3 days under 1.0% nutrient agar, cells were stained with neutral red to visualize plaques.

FIG. 10, depicts kinetics of virus growth at 33.5° C. Vero cell monolayers were infected at an MOI of 4, and the extracellular medium was harvested at the indicated times post-infection. Plaque assays were used to determine the titer of infectious particles per ml as described in Materials and Methods, Example 13.

∘------------∘, LED3; Δ -- -- -- -- -- -- Δ, VR318.

FIG. 11, depicts thermal stability of LED3 and VR318 virus at various temperatures. Virus samples containing approx. 10⁷.3 pfu/ml were incubated at 22° C. (circle), 37° C. (square) and 42° C. (triangle). Samples were periodically removed and titer of infectious virus determined by plaque assays. Open markers, LED3; filled markers VR318.

DETAILED DESCRIPTION OF THE INVENTION

This invention provides a method for producing a true RNA virus cDNA comprising the steps of (a) isolating genomic RNA from an RNA source virus; (b) employing RNA sequencing means to determine the nucleotide sequence of a portion of the isolated genomic RNA; (c) employing cDNA synthesis means to produce a double-stranded cDNA from the isolated genomic RNA; (d) employing DNA sequencing means to determine the nucleotide sequence of a portion of the cDNA, wherein the portion of the cDNA corresponds to the portion of the RNA sequenced in step (b); (e) comparing the sequenced cDNA with the sequenced RNA to determine substantive differences in nucleotide sequence; and (f) altering the substantive differences in the cDNA to produce a true RNA virus cDNA.

This invention also provides a method for producing an RNA virus cDNA comprising the steps of a) isolating genomic RNA from an RNA source virus, b) employing RNA sequencing means to determine the nucleotide sequence of a portion of said isolated genomic RNA, c) employing cDNA synthesis means to produce a double-stranded cDNA from said isolated genomic RNA, d) employing DNA sequencing means to determine the nucleotide sequence of a portion of said cDNA, wherein said portion of said cDNA corresponds to said portion of said RNA sequenced in step b), e) comparing said sequenced cDNA with said sequenced RNA to determine substantive differences in nucleotide sequence, and altering said substantive differences in said cDNA to produce an RNA virus cDNA.

This invention also provides the methods described hereinabove, wherein the RNA source virus is a Picornavirus, such as a vaccine strain 3 poliovirus. In one embodiment of the invention, the portion of RNA sequenced in step (b) comprises nucleotide 2493 of a vaccine strain 3 poliovirus. In a further aspect of this embodiment, the RNA consists of about 100 to 200 nucleotides. Alternatively, the nucleotide sequence of the entire isolated viral RNA is determined in step (b).

This invention further provides a true RNA virus cDNA or an RNA virus cDNA produced by the methods described hereinabove. For example, a true RNA virus cDNA or an RNA virus cDNA may be produced which is derived from a vaccine strain 3 poliovirus, the cDNA being selected from that contained in a novel plasmid designated pLED3.2 or pLED3, respectively or cDNAs which code on expression for the polypeptides coded on for expression by pLED3.2 or pLED3, respectively and the recombinant DNA molecule produced thereby.

The recombinant DNA molecule can be operatively linked to a promoter of RNA transcription. Suitable promoters include, but are not limited to the T7 promoter.

A host transformed with a recombinant DNA molecule described hereinabove is also provided by this invention, wherein the host is selected from the group consisting of bacteria, such as E. coli, yeast and other fungi, insect cells and animal cells. Suitable animal cells include, but are not limited to Vero cells, HeLa cells, COS cells, CV-1 cells and primary monkey kidney cells.

Further provided by this invention is a method for producing a viable RNA virus comprising the steps of: (a) culturing a host described hereinabove under conditions which permit the production of viable RNA virus; and (b) harvesting the viable RNA virus from the host cell culture.

This invention provides a method of producing a viable RNA virus comprising the steps of: (a) employing in vitro transcription means to produce RNA from a recombinant DNA molecule described hereinabove (b) isolating the RNA; (c) transfecting a host with the isolated RNA, wherein the host is an animal cell; (d) culturing the host under conditions which permit the production of viable RNA virus; and (e) harvesting the viable RNA virus from the host cell culture. For example, vaccine strain 3 poliovirus may be used and the host may be a primary monkey kidney cell.

This invention further provides a method of producing a viable RNA virus comprising the steps of: (a) transfecting a host with RNA virus cDNA, wherein the cDNA is selected from that contained in the plasmid pLED3 or pLED3.2 or cDNAs which code on expression for the polypeptides coded on for expression by pLED3 or pLED3.2, wherein said host is an animal cell; (b) culturing said host under conditions which permit the production of viable RNA virus from said host cell culture.

This invention further provides an RNA virus, produced by transforming a suitable host cell with a recombinant nucleic acid sequence which encodes for the virus; culturing the host cell under conditions which permit the production of virus; and isolating the virus so produced.

In one embodiment of this invention, the RNA virus may be a vaccine strain 3 poliovirus and the recombinant nucleic acid sequence is a recombinant infectious full length nucleic acid sequence which encodes for the virus; culturing the host cell under conditions which permit the production of virus; and isolating the virus so produced.

In one embodiment of this invention, the RNA virus may be a vaccine strain 3 poliovirus and the recombinant nucleic acid sequence is a recombinant infectious full length nucleic acid sequence which encodes for the vaccine strain 3 poliovirus.

A method of screening for variants of a strain 3 poliovirus is also provided which comprises the steps of: (a) isolating genomic RNA from the poliovirus; (b) employing RNA sequencing means to determine the nucleotide at position 2493.

This invention also provides a method for increasing the attenuation of a strain 3 poliovirus encoded by an RNA virus cDNA, wherein the cDNA comprises the nucleotide sequence ATT at positions 2492 to 2492, the method comprising the step of mutagenizing the cDNA at nucleotide 2493 to change the T to C. In one embodiment of the invention, the method can further comprise the step of mutagenizing the cDNA at nucleotide 2494 to change the T to a nucleotide selected from the group consisting of A, C and G.

This invention provides a vaccine useful for immunizing a subject, for example a human, against infectious poliovirus, wherein the vaccine comprises an effective amount of an RNA virus, produced by transforming a suitable host cell with a recombinant nucleic acid sequence which encodes for the virus; culturing the host cell under conditions which permit the production of virus; and isolating the virus so produced, effective to immunize the subject, and a suitable carrier. The RNA virus may be a vaccine strain 3 poliovirus and the recombinant nucleic acid sequence is a recombinant infectious full length nucleic acid sequence which encodes for the vaccine strain 3 poliovirus.

The recombinant nucleic acid sequence which encodes for the RNA virus may be a DNA sequence, or an RNA sequence or in the preferred embodiment of this invention, the recombinant nucleic acid sequence is a cDNA sequence. Suitable cDNAs are the plasmids designated pLED3.2 or pLED3, each of which contains a promoter linked cDNA nucleic acid sequence which encodes for the vaccine strain 3 poliovirus. The plasmid pLED3 was deposited with the American Type Culture Collection (ATCC), located at 12301 Parklawn Drive, Rockville Md., 20852, under the provisions of the Budapest Treaty on the International Recognition of the Deposit of Microorganisms for the Purposes of Patent Procedure. The plasmid pLED3 was assigned Accession No. 40789. The plasmid pLED3.2 also was deposited with the ATCC under the provisions of the Budapest Treaty and was assigned Accession No. 75062.

Suitable host cells include, but are not limited to bacteria, such as E. coli, yeast and other fungi, insect cells and animal cells.

Suitable animal cells include, but are not limited to Vero cells, HeLa cells, COS cells, CV-1 cells and primary monkey kidney cells.

The suitable carrier may be a physiologically balanced culture medium, such as aline containing stabilizing agents, for example, dextrose and lactose, or other nontoxic substances. These vaccines may also be formulated with a suitable adjuvant such as alum. For methods of vaccine preparation, see J. I. Duffy, Vaccine Preparation Techniques, Noyes Data Corporation (1980), and G. W. Warr, "Preparation of Antigens and Principles of Immunization", in J. J. Marchalonis and G. W. War. eds., Antibody As A Tool--The Applications of Immunochemistry, pp. 21-58, John Wiley & Sons (1982).

The RNA virus may also be desiccated, e.g., by freeze drying for storage or for subsequent formulation into liquid vaccines.

Further provided by this invention is a method of immunizing a subject such as a human against infectious poliovirus, wherein the method comprises administering to the subject a suitable dose of the vaccine described hereinabove. Suitable methods of administering vaccines are well known to those of ordinary skill in the art. However, by way of example, such methods may include but are not limited to intramuscular, intravenous, subcutaneous, intratracheal or intranasal administration.

Additionally, the effective immunizing amount is an amount which is necessary to invoke the production of antibodies by the subject thereby conferring protection on the subject against infectious poliovirus or poliomyelitis.

Throughout this application, references to specific nucleotides in cDNA molecules are to nucleotides present on the coding strand of the cDNA, i.e., the strand which has a sequence equivalent to the position RNA strand of an RNA virus. References to specific nucleotide position numbers in strain 3 poliovirus follow the nucleotide numbering system of G. Stanway et al., "Comparison of the Complete Nucleotide Sequences of the Genomes of the Neurovirulent Poliovirus P3/Leon/37 and its Attenuated Sabin Vaccine Derivative P3/Leon 12alb", Proc. Natl. Acad. Sci. U.S.A., 81, pp 1539-43 (1984). The following standard abbreviations are used throughout the specification and in the claims to indicate specific nucleotides:

C--cytosine

A--adenosine

T--thymidine

G--guanosine

U--uracil

The term "source virus" refers to the RNA virus from which RNA is isolated and used as a template for cDNA production. And the term "increased attenuation" is used throughout to indicate a lower rate of reversion to the neurovirulent phenotype than those of conventional vaccine virus strains.

The term "true RNA virus cDNA", as used herein, refers to a cDNA which directs the production of a viable RNA virus that is phenotypically similar to the source virus. Accordingly, the present invention encompasses cDNA molecules which, by virtue of the redundancy of the genetic code, are characterized by a nucleotide sequence that differs from that of the source virus RNA, but which encode polypeptides having the same amino acid sequences as those encoded by the source virus RNA. The invention also encompasses cDNAs which encode amino acid sequences which differ from those of the source virus polypeptides, but which do not produce phenotypic changes. Hereinafter, these altered, but phenotypically equivalent amino acid sequences are referred to as "equivalent amino acid sequences." And this invention encompasses cDNA molecules characterized by changes in non-coding regions that do not alter the phenotype of the RNA virus produced therefrom when compared to the source virus. Differences between the nucleotide sequence of an RNA virus cDNA and the source virus RNA which result in phenotypical differences in the virus produced therefrom are hereinafter referred to as "substantive differences".

This invention provides a method for producing an RNA virus cDNA comprising the steps of (a) isolating genomic RNA from an RNA source virus, (b) employing RNA sequencing means to determine the nucleotide sequence of a portion of the isolated genomic RNA, (c) employing cDNA synthesis means to produce a double-stranded cDNA from the isolated genomic RNA, (d) employing DNA sequencing means to determine the nucleotide sequence of a portion of the cDNA, wherein the portion of the cDNA corresponds to the portion of the RNA sequenced in step (b), (e) comparing the sequenced cDNA with the sequenced RNA to determine substantive differences in nucleotide sequence, and (f) altering the substantive differences in the cDNA to produce an RNA virus cDNA. In one embodiment of this invention, the RNA virus is a true RNA virus and in another embodiment of this invention the RNA virus is a "phenotypically equivalent" virus. In one embodiment of this invention, the RNA source virus is a Picornavirus, for example a vaccine strain 3 poliovirus.

This invention also provides a method as described hereinabove wherein the portion of RNA sequenced in step (b) comprises nucleotide 2493 of a vaccine strain 3 poliovirus, or preferably, wherein the RNA consists of about 100 to 200 nucleotides.

In another embodiment of this invention, the method described hereinabove wherein the nucleotide sequence of the entire isolated viral RNA is determined in step (b).

The method of this invention may also be used to produce an RNA virus cDNA by altering nucleotides in various regions of the poliovirus genome, for example, to correct mutations introduced into the region coding for the amino terminus of VP1 of the type 1 Mahoney poliovirus K. Kirkegaard, "Mutations in VP1 of Poliovirus Specifically Affect Both Encapsulation and Release of Viral RNA", J. Virology, 64, pp. 195-206 (1990)!.

Further provided by this invention is an RNA virus cDNA produced by any of the methods described hereinabove which may include, but is not limited to an RNA virus cDNA derived from a vaccine strain 3 poliovirus, the cDNA being selected from the group consisting of pLED3 or pLED3.2 and cDNAs which code an expression for the polypeptides coded on for expression by pLED3 or pLED3.2.

This invention also provides a recombinant DNA molecule comprising an RNA virus cDNA as described hereinabove, which may include, but is not limited to the recombinant DNA molecule wherein the RNA virus cDNA is operatively linked to a promoter of RNA transcription such as the T7 promoter.

A host transformed with a recombinant DNA molecule described hereinabove is also provided wherein the host is selected from the group consisting of bacteria, yeast and other fungi, insect cells and animal cells. In a preferred embodiment of this invention, the host is an animal cell and is selected from the group consisting of Vero cells, HeLa cells, COS cells, CV-1 cells and primary monkey kidney cells. In another embodiment of this invention, the host is E. coli.

This invention further provides a method for producing a viable RNA virus comprising the steps of: culturing a host described hereinabove under conditions which permit the production of viable RNA virus and harvesting the viable RNA virus from the host cell culture. This invention also provides a method of producing a viable RNA virus comprising the steps of employing in vitro transcription means to produce RNA from a recombinant DNA molecule described hereinabove, isolating the RNA, transfecting a host with the isolated RNA, wherein the host is an animal cell, culturing the host under conditions which permit the production of viable RNA virus and harvesting the viable RNA virus from the host cell culture. In the preferred embodiment of this invention, the RNA virus is a vaccine strain 3 poliovirus and the host is a primary monkey kidney cell.

A method of screening for variants of a strain 3 poliovirus is provided by this invention which comprises the steps of isolating genomic RNA from the poliovirus, and employing RNA sequencing means to determine the nucleotide at position 2493.

This invention also provides a method for increasing the attenuation of a strain 3 poliovirus encoded by an RNA virus cDNA, wherein the cDNA comprises the nucleotide sequence ATT at positions 2492 to 2494, and the method comprising the step of mutagenizing the cDNA at nucleotide 2493 to change the T to C. This method may further comprise the step of mutagenizing the cDNA at nucleotide 2494 to change the T to a nucleotide selected from the group consisting of A, C and G.

The determination of phenotypic differences may be carried out by several methods which are well known in the art. Preferably, a true RNA virus cDNA of an attenuated strain 3 poliovirus encodes a virus which has the same degree of attenuation as the source virus. Several well characterized markers can be used to determine phenotypic changes in a strain of poliovirus. These are the "d" markers, which regulate the ability of the virus to grow under acid conditions M. Vogt et al., "Mutants of Poliomyelitis Viruses with Reduced Efficiency of Plating in Acid Medium and Reduced Neuropathogenicity", Virology, 4, pp. 141-55 (1957)!; and the "rct₄₀ " marker, which regulates the ability of the virus to grow at elevated temperatures A. Lwoff, "Factors Influencing the Evolution of Viral Diseases at the Cellular Level and in the Organism", Bact. Rev., 23, pp. 109-24 (1959)!.

The most preferred method of determining phenotypic changes between an attenuated strain 3 source poliovirus and the virus produced from its true RNA virus cDNA is a comparison of neurovirulence. Several protocols for assessing neurovirulence are known in the art Code of Federal Regulations, Title 21, Chapter 1, pp. 91-93 (Apr. 1, 1987 edition)!.

Production of a True RNA Virus cDNA

According to one embodiment, the present invention relates to a method for producing true RNA virus cDNA. The production of a true RNA virus cDNA according to this invention may employ any RNA source virus--positive single-stranded, negative single-stranded, or double-stranded. Preferably, the source virus is a positive single-stranded virus. More preferably, the virus is a human enterovirus belonging to the family Picornaviridae. Most preferred is an attenuated type 3 vaccine strain poliovirus (also referred to herein as "P3/Sabin").

A. Proliferation and Isolation of Virus

The initial step in the production of true RNA virus cDNA involves the proliferation and purification of the source virus and the isolation of RNA therefrom. Techniques for proliferating and isolating virus are well known in the art R. J. Kuchler, "Biochemical Methods in Cell Culture and Virology", Dowden, Hutchinson and Ross, Inc., Stroudsburg, Pa. (1977)!. It will be understood that the method of viral growth, including choice of host cell, selection of growth medium, and conditions of growth, will differ depending upon the particular source virus. In a preferred embodiment, attenuated strain 3 poliovirus is proliferated in primary monkey kidney cells according to known methods A. Sabin et al., "Studies On Variants Of Poliomyelitis Virus", J. Exp. Med., 99, pp. 551-76 (1954)!. The following procedures are applicable for any strain of poliovirus. However, the use of an alternate RNA source virus in these procedures may require certain virus-specific modifications which are known to those of skill in the art.

Viral proliferation of the RNA virus is allowed to proceed to a point where a sufficient quantity of virus can be harvested for RNA isolation. The culture media containing the source virus is then collected and cellular debris is removed, preferably by centrifugation at a speed which will not pellet the virus. This is typically about 2,500 rpm for about 20 minutes. The supernatant may then be further purified by ultrafiltration employing a filter having a pore size that is larger than the viral particles. Preferably, a filter of approximately 0.22 μM is used.

Following filtration, the viral particles are collected by polyethylene glycol precipitation followed by centrifugation or, more preferably, by high speed centrifugation at about 70,000 rpm. The viral particles are then resuspended in a small volume of buffer, preferably TNE (10 mM Tris-HCl, 100 mM NaCl, 1 mM EDTA, pH 7.4). A non-ionic detergent may optionally be added to the viral particle suspension to dissolve any contaminants. Although the high speed viral pellet is sufficiently pure to use as a source of viral RNA the viral suspension may optionally be further purified by sucrose density gradient centrifugation.

If density gradient centrifugation is employed, fractions are collected from the gradient and analyzed for the presence of source virus. Any conventional assay which detects source virus-specific proteins may be employed. Such assays include, for example, Western blots, ELISA, radioimmunoassay, or polyacrylamide gel electrophoresis and comparison to a source virus standard. The latter technique is most preferred because it is the most economical.

B. Isolation and Sequencing of Viral RNA

Once the virus is purified as described above, viral RNA may then be isolated. This is achieved by first dissociating the viral capsid proteins by treatment with detergent, preferably sodium dodecyl sulfate ("SDS") at a final concentration of 0.5%. The dissociated proteins are then extracted by treatment of the sample with organic solvents. Extraction is preferably achieved with a phenol:chloroform:isoamyl alcohol mixture. The RNA present in the aqueous phase may then be isolated by any method well known in the art T. Maniatis, "The Molecular Guide To Cloning", Cold Spring Harbor Press (1983)!. Preferably, the viral RNA is precipitated with 0.5 volumes of 7.5 M ammonium acetate and 2.5 volumes of ethanol at -20° C. Quantitation and integrity of the viral RNA may be determined by agarose gel electrophoresis. It should be noted, as is well known in the molecular biology art, that great care must be taken in the preparation and handling of RNA samples due to the prevalence of RNases. Methods for inactivating RNases that may be present in reagents and in vessels used in RNA preparation are well known T. Maniatis, supra!.

Once the source virus RNA has been isolated, it is subjected to dideoxy nucleotide sequencing F. Sanger et al., "DNA Sequencing With Chain-Terminating Inhibitors", Proc. Natl. Acad. Sci. USA, 74: 5463-67 (1977)! employing modifications for RNA D. C. Deborde et al., "Resolution Of A Common RNA Sequencing Ambiguity By Terminal Deoxynucleotidyl Transferase", Anal. Biochem., 157, pp. 275-82 (1986)!. According to a preferred embodiment of this invention, the entire RNA genome of the virus is sequenced and compared to the cDNA sequence by methods which are hereinafter described. In this embodiment of the invention the source virus is most preferably an attenuated type 3 vaccine strain poliovirus.

C. Synthesis and Screening of an RNA Virus cDNA Library

Following RNA sequencing, the source virus RNA is used as a template for the synthesis of a full-length, double-stranded cDNA. Any well-known method or commercially available cDNA synthesis kit is employed to synthesize cDNA. Preferably, cDNA is synthesized by the method of V. R. Racaniello et al., "Molecular Cloning Of Poliovirus cDNA And Determination Of The Complete Nucleotide Sequence Of The Viral Genome", Proc. Natl. Acad. Sci. USA, 78, pp. 4887-91 (1981). For ease of detection, the first strands of cDNA may optionally be radiolabeled by employing a radioactive nucleotide during cDNA synthesis. Once synthesized, the single-stranded cDNA are preferably size-fractionated either by agarose gel electrophoresis or, more preferably, by gel chromatography. The larger cDNAs are isolated and used as templates for second-strand synthesis. Double-stranded cDNA is then size-fractionated as described above and the largest molecules are used for the creation of a source virus cDNA library. The cDNA are then tailed, either by the olio dG/dC method or by the addition of restriction enzyme linkers, and cloned into an appropriate vector. The choice of vector will be based upon the technique that will be employed to screen the library. For example, the use of an immunoscreening technique requires that the cDNA be inserted into an expression vector, such as lambda gt11 (ATCC accession number 37194). If a hybridization screening method is employed, vectors such as bacterial plasmids are most convenient. Preferably, the cDNA are tailed by the olio dG/dC method and cloned into the PstI site of pBR322.

Once the RNA virus cDNA library is created, it is screened o for a full-length cDNA clone. This may be achieved by well-known screening methods, such as antibody screening or hybridization to a labeled probe. Most preferably, the library is screened by colony hybridization using virus-specific cDNA probes M. Grunstein et al., "Colony Hybridization: A Method for the Isolation of Cloned DNAs That Contain A Specific Gene", Proc. Natl. Acad. Sci. USA, 72, pp. 3961-65 (1975)!, based on known nucleotide sequences or amino acid sequences of the virus. Once a clone containing an RNA virus cDNA has been identified and isolated, it may be removed from the vector and analyzed to determine whether it represents a full-length RNA virus cDNA. In a preferred embodiment of the invention, the dC-tailed cDNA is removed from a Pst I cut, dG-tailed vector by digestion with PstI. Partial cDNAs may be used to reprobe the library and to locate longer, or full-length cDNAs. If no full-length cDNAs can be detected, several overlapping partial cDNAs representing the entire source virus genome may be ligated together at common restriction sites to produce a full-length cDNA V. R. Racaniello et al., "Cloned Poliovirus cDNA Is Infections In Mammalian Cells", Science, 214, pp. 916-19 (1981)!. Any portion of the viral genome which is not represented by an isolated cDNA may be synthesized using standard oligonucleotide synthesizing techniques and subsequently ligated into its proper position to form a full-length cDNA.

D. Sequencing and Alteration of cDNA to Correspond to Source Virus RNA

Portions of the full-length cDNA corresponding to the sequenced portion of the source virus RNA are then sequenced by standard DNA sequencing methods. The sequenced regions of the viral RNA and the RNA virus cDNA are then compared. Theoretically, the cDNA should correspond exactly to the RNA which served as its template. However, it is known that reverse transcriptase can produce errors when transcribing cDNA from RNA I. M. Verma, "Reverse Transcriptase", In The Enzymes, Vol. 14, P. D. Boyer, ed., Academic Press, New York, pp. 87-104 (1981)!. Therefore, according to the method of this invention, it is necessary to alter the nucleotide sequence of the cDNA so that it corresponds to the sequenced RNA.

The present invention contemplates altering the cDNA sequence at those sites which are responsible for phenotypic changes. Accordingly, portions of the cDNA which encode polypeptides having equivalent amino acid sequences as those encoded by the source virus RNA need not be altered.

Preferably, any cDNA nucleotide mutation which may potentially affect virus production or viral polypeptide synthesis should be altered to correspond to the source virus RNA.

Methods for altering the nucleotide sequence of a cDNA molecule are known in the art and include site-directed mutagenesis C. A. Hutchinson, III et al., "Mutagenesis At A Specific Position In A DNA Sequence", J. Bio. Chem., 253, pp. 6551-60 (1978); A. Razin et al., "Efficient Correction Of A Mutation By Use Of A Chemically Synthesized DNA", Proc. Natl. Acad. Sci. USA, 75, p. 4268 (1978)!. Alternatively, a partial cDNA clone containing the desired sequence may be isolated from the cDNA library and its DNA, or a portion thereof, substituted in the full-length clone for the sequences which are to be altered. Once the cDNA sequence has been altered, it is utilized in other embodiments of this invention.

According to another embodiment of the present invention, the true RNA virus cDNA may be inserted into an appropriate vector and used to transform an appropriate host. The choice of vector will depend upon the ultimate intended use of the cDNA. Similarly, the choice of host will depend upon both the vector selected and the ultimate goal of transformation.

For example, if it is desirable to simply store the cDNA and create and unlimited supply thereof, the cDNA will be inserted into a vector which is capable of transforming a unicellular organism, such as a bacteria, yeast or other fungi, an animal cell or an insect cell. According to one embodiment of this invention, the cDNA is inserted into the PstI site of pBR322 and the host to be transformed is E. coli.

According to another embodiment of the invention, the true RNA virus cDNA may be operatively linked to a promoter of transcription. As used herein, the term "operatively linked" means positioned in such a manner that the promoter will direct the transcription of RNA off of the true virus cDNA. Examples of such promoters are SP6, T4 and T7. The most preferred promoter is the T7 promoter J. J. Dunn et al., "Complete Nucleotide Sequence of Bacteriophage T7 DNA and the Locations of T7 Genetic Elements", J. Mol. Biol., 166, pp. 477-535 (1983)!. Vectors which contain both a promoter and a cloning site into which an inserted piece of DNA is operatively linked to that promoter are well known in the art. Preferably, these vectors are capable of transcribing RNA in vitro. Examples of such vectors are the pGEM series Promega Biotec, Madison, Wis.!.

According to a further embodiment, the present invention relates to methods for producing viable positive stranded RNA virus. This may be achieved by transfecting an appropriate host with a true RNA virus cDNA. Any standard method of transfecting animal cells with DNA may be employed F. M. Ausubel et al., "Current Protocols in Molecular Biology", Greene Publishing Associates & Wiley Intersciences (1987)!. The host is then cultured under conditions conductive to the production of RNA virus. Such conditions are well known in the art and will vary depending upon the virus to be produced. Similarly, the choice of host cell should be one which is compatible with the virus, preferably primate cells in culture. In a preferred embodiment of the invention, wherein the true RNA virus cDNA encodes an attenuated strain 3 poliovirus, the host is selected from the group consisting of Vero cells, HeLa cells, COS cells, CV-1 cells, human diploid cell lines, such as WI-38 and MRC5, and primary monkey kidney cells. The most preferred hosts are monkey kidney cells. Once the cells have produced a desirable level of virus, the virus is harvested from the cell culture according to standard protocols.

According to an alternative embodiment of this invention, viral RNA, which is produced by in vitro transcription of a true RNA virus cDNA according to the invention, may be employed in methods for producing viable RNA virus. The in vitro transcribed RNA is isolated by standard methods and used to transfect an appropriate host. The use of RNA to transfect cells is known in the art S. van der Werf et al., "Synthesis of Infectious Poliovirus RNA by Purified T7 RNA Polymerase", Proc. Natl. Acad. Sci. USA, 83, pp. 2330-34 (1986)!. Once the cells are transfected, they are grown and the virus harvested according to standard protocols.

According to another embodiment, this invention relates to a method of screening for variants of a strain 3 poliovirus. Through RNA sequencing, it was discovered that the presence of a C at nucleotide position 2493 of this virus is linked to the attenuated strain 3 genotype. Previous analyses of this strain failed to recognize the importance of this position in attenuation due to a combination of two factors: the sequences of wild-type and attenuated strain 3 poliovirus were compared on the cDNA level, rather than the genomic RNA level; and some of these cDNAs were made from plaque isolates of the original viral samples Stanway et al., "Comparison of the Complete Nucleotide Sequences of the Genomes of the Neurovirulent Poliovirus P3/Leon/37 and its Attenuated Sabin Vaccine Derivative P3/Leon 12alb", Proc. Natl. Acad. Sci. USA, 81, pp. 1539-43 (1984); H. Toyoda et al., "Complete Nucleotide Sequences of All Three Poliovirus Serotype Genomes", J. Mol. Biol., 174, pp. 561-85 (1984)!. As a result of either errors in reverse transcription or mutations resulting from virus passaging, the cDNAs previously produced contained a T at position 2493. Therefore, the RNA of type 3 vaccine strain of poliovirus was mistakenly believed to contain U at this position, the same nucleotide present in the wild-type strain Stanway et al., supra; H. Toyoda et al., supra!. Accordingly, position 2493 was never thought to contribute to the attenuation of the poliovirus strain 3 genome.

Therefore, the method of screening for variants of a strain 3 poliovirus according to this invention comprises the steps of sequencing the RNA of the virus and determining the nucleotide at position 2493. Most preferably, the portion of RNA to be sequenced consists of about 100-200 nucleotides flanking nucleotide 2493. This method may be used during amplification of the source virus (e.g., in vaccine production) to ensure maintenance of C at position 2493 in the viral genome.

This invention also relates to a method for increasing the attenuation of a strain 3 poliovirus produced from a true RNA virus cDNA, wherein the cDNA comprises the sequence ATT at positions 2492-2494. The method comprises mutagenizing the nucleotide at position 2493 from a T to a C and subsequently using the mutagenized cDNA to produce viable virus. The presence of a C instead of a U at position 2493 of strain 3 poliovirus RNA would be expected to alter the sixth amino acid of the viral capsid protein, VP1, from isoleucine to threonine, based on the genetic code. The condon encoding this amino acid spans nucleotides 2492-2494. Therefore, the method for increasing the attenuation of strain 3 poliovirus according to this invention may also include mutagenizing nucleotide 2494 from a T to either A, C or G.

In order that the invention described herein may be more fully understood, the following examples are set forth. It should be understood that these examples are for illustrative purposes only and are not to be construed as limiting this invention in any manner.

EXAMPLE 1

Purification Of A Strain 3 Poliovirus

All glassware utilized in the methods described below is either sterilized or treated with diethyl pyrocarbonate (DEP) to destroy RNases. All reagents are made up with water that had been treated with DEP prior to use.

Primary monkey kidney cells are infected with an attenuated strain 3 poliovirus, isolated by plaque purification from the "ORIMUNE" vaccine (Lederle Laboratories, Pearl River, N.Y.), at a low multiplicity of infection ("MOI"). The infected cultures are maintained at 34° C. in modified Earle's lacteal maintenance medium, pH 7.3, until the cell monolayer is destroyed (+4 cytopathic effect ("CPE")). The culture media (120 ml) is collected and centrifuged at 2,500 rpm for 20 minutes to remove any cellular debris. The supernatant is then filtered through a 0.22 μM Millex GV disc filter. The filtrate is then placed in quick seal high speed centrifuge tubes and spun in a 70.1 Ti rotor at 70,000 rpm for one hour at 4° C.

The virus pellet is resuspended in 4 ml of RNase-free TNE (10 mM Tris-HCl, 100 mM NaCl, 1 mM EDTA, pH 7.4) and the suspension transferred to a 15 ml polypropylene tube. The viral capsid are then dissociated by the addition of RNase-free SDS to a final concentration of 0.5%.

EXAMPLE 2

Isolation And Sequencing Of Viral RNA

Viral RNA is then isolated by extracting the dissolved capsid as prepared in Example 1, with an equal volume of phenol:chloroform:isoamyl alcohol (25:24:1). The aqueous layer is removed and re-extracted with the same organic solution. One-half volume of 7.5M ammonium acetate and 2.5 volumes of 100% ethanol is added to the aqueous extract and the RNA is precipitated at -20° C. for at least 30 minutes. The RNA is then pelleted by centrifugation at 12,000 rpm for 30 minutes. The RNA pellet is washed twice with ice-cold 70% ethanol, dried under vacuum and resuspended in 20 μM of water. The integrity and concentration of the RNA is estimated by using agarose gel electrophoresis.

The viral RNA is then sequenced essentially by the method of DeBorde D. C. Deborde et al., Anal. Biochem., 157, pp. 275-82 (1986)!, the disclosure of which is incorporated here by reference. The specific details of sequencing are described below.

Approximately 500 mg of purified viral RNA is heat denatured at 100° C. for 3 minutes in 200 mM Tris-HCl, pH 8.3, 200 mM KCl, 20 mM MgCl₂, 10 mM DTT and then quick-chilled by immersion into an ice bath. Primer, dATP and enzymes are then mixed in with the RNA, as described by DeBorde et al. Two and one-half μl of primer-RNA mix is then combined with an equal volume of various reaction mixes, in separate tubes, to give the following component concentrations:

Tube A: 50 mM Tris-HCl, pH 8.3, 50 mM KCl, 5 mM MgCl, 10 mM DTT, 100 μM each of dCTP, dGTP and dTTP, 5 μCi ³⁵ S!-dATP, 1.25 μM ddATP, 100 ng RNA, 15 ng primer and 2.8 units reverse transcriptase;

Tube C: same as tube A, except 20 μM dCTP, 2.5 μM ddCTP and no ddATP;

Tube G: same as tube A, except 20 μM dGTP, 3.0 μM ddGTP and no ddATP;

Tube T: same as tube A, except 20 μM dTTP, 7.5μM ddTTP and no ddATP;

Tube N: same as tube A, except no ddATP.

Each tube is incubated at 42° C. for 20 minutes. Following this incubation, 1 μl of chase solution (1 mM each of dATP, dCTP, dGTP and dTTP and 2 units of terminal deoxynucleotidyl transferase) is added to each tube and the tubes incubated for another 30 minutes at 37° C. The reactions are stopped by freezing at -20° C. Prior to electrophoresis, 5 μl of formamide dye mixture is added to each tube. The samples are then heated to 100° C. for 3 minutes and 5 μl of each sample is loaded per gel lane.

The sequencing gels (35 cm×13 cm×0.1 mm) are 6% polyacrylamide (38:2; acrylamide:bis-acrylamide) containing 7M urea in TBE (89 mM Tris, 89 mM boric acid, 2 mM EDTA).

When the complete sequence of the attenuated strain 3 poliovirus RNA genome is compared to the published P3/Sabin cDNA sequence Stanway et al., Proc. Natl. Acad. Sci. USA, 81, pp. 1539-43 (1984)!, two nucleotide differences are observed, one of which causes an amino acid change. These are shown in the following table:

    ______________________________________                                         Nucleotide                                                                               P3/Sabin   P3/Sabin  Amino Acid                                      Position  Stanway    RNA       Change                                          ______________________________________                                         2493      T          C         Ile to Thr                                                                     (VPI)                                           6061      C          U(T)      silent                                          ______________________________________                                    

The difference observed at position 2493 is important for several reasons. First, it encodes a significant amino acid change in the viral capsid protein VP1. Moreover, because wild-type strain 3 poliovirus has a U at this position (T in the cDNA), the purported presence of a T at 2493 of the attenuated strain 3 poliovirus cDNA may have obscured a significant difference between the wild-type and attenuated genomes.

EXAMPLE 3

Synthesis Of A Full-Length Poliovirus cDNA

The viral RNA obtained in the previous Example is used as a template for the synthesis of a double-stranded cDNA using the method of V. R. Racaniello et al., "Molecular Cloning Of Poliovirus cDNA And Determination Of The Complete Nucleotide Sequence Of The Viral Genome", Proc. Natl. Acad. Sci. USA, 78, pp. 4887-91 (1981). Specifically, 2.5 μg of viral RNA is used to produce first strand cDNA in a 100 μl reaction containing 50 mM Tris-HCl, pH 8.3, 10 mM MgCl₂, 50 mM KCl, 0.5 mM each of dATP, dTTP, dGTP and dCTP, 0.4 mM DTE, 30 μg/ml olio dT (12-18 nucleotides in length), 4 mM sodium pyrophosphate, 10 μCi/μl α-³² P!-dATP, 1 unit/μl RNasin and 2 units/μl reverse transcriptase Boehringer-Mannheim, Indianapolis, Ind.!. The reaction is incubated at 42° C. for 60 minutes. The reaction is stopped by the addition of EDTA to a final concentration of 50 μM. The solution is then phenol extracted and the aqueous layer applied to a 5×0.7 cm Sepharose CL-4B column. The column is developed with 0.3M NaCl, 10 mM Tris-HCl, pH 8.0, 1mM EDTA. The profile thus obtained is depicted in FIG. 1.

Fractions 5-13 are pooled from the column and 20 μg of glycogen is added thereto. The cDNA is then precipitated at -20° C. by the addition of 2.5 volumes of ethanol, pelleted by centrifugation and resuspended in 25 μl of 10 mM Tris-HCl, pH 8.0, 1 mM EDTA (TE) for use in second strand synthesis.

Second strand synthesis is performed in a 100 μl reaction containing the single-stranded cDNA, 20 mM Tris-HCl, pH 7.4, 7 mM MgCl₂, 0.1M KCl, 50 μg/ml bovine serum albumin (BSA), 0.1 mM of each dNTP, 150 μM β-NAD, 5 μg/ml E. coli DNA ligase, 250 units/ml E. coli DNA polymerase I and 90 units/ml RNase H. The mixture is incubated at 15° C. for 60 minutes followed by 90 minutes at room temperature. The cDNA is then phenol extracted and chromatographed over Sepharose CL-4B as described above. The void volume fractions are pooled, precipitated with ethanol and resuspended in 25 μl TE, as described previously.

The resulting double-stranded cDNA is tailed with deoxycytidine (dC) in a 100 μl reaction containing 140 mM K-cacodylate, pH 7.2, 30 mM Tris-HCl, 1 mM CoCl₂, 1 mM DTT, 50 μg/ml BSA, 150 μM dCTP and 800 units/ml terminal deoxynucleotidyl transferase. The reaction is incubated at 37° C. for 60 minutes and the solution is then phenol extracted. The aqueous layer is removed, ether extracted and the tailed cDNA is precipitated therefrom with ethanol containing 20 μg glycogen. The resulting pellet is suspended in 50 μl of 10 mM Tris-HCl, pH 7.5, 100 mM NaCl, 1 mM EDTA (NTE).

EXAMPLE 4

Production Of A Poliovirus cDNA Library

Increasing amounts (1, 2, 5, 10 μl) of dc-tailed, double-stranded cDNA, produced as in Example 3, are annealed to 10 ng of PstI-cleaved, dG-tailed pUC9 (Pharmacia, Piscataway, N.J.) in NTE by heating the mixture to 68° C. in a water bath for 5 minutes, cooling the mixture in a 42° C. water bath and then slowly decreasing the temperature to room temperature overnight by shutting off the water bath. This allows for optimal hybridization between the dC tails on the cDNA and the dG tails on pUC9. The annealed mixtures are then used to transform E. coli DH5α cells (Bethesda Research Labs, Gaithersburg, Md.) using standard procedures. Ampicillin-resistant colonies are selected for screening.

EXAMPLE 5

Isolation Of Poliovirus Clones

Bacterial colonies are picked onto gridded plates and screened by colony hybridization, using linearized plasmid pOLIO (Sabin) as a probe J. W. Almond et al., "Attenuation and Reversion to Neurovirulence of the Sabin Poliovirus Type-3 Vaccine", In Vaccines, 85, Cold Spring Harbor Laboratory, Cold Spring Harbor, N.Y., pp. 271-77 (1985)!. Plasmid pOLIO (Sabin) is a plasmid containing a full-length cDNA derived from P3/Leon 12a₁ b G. Stanway et al., Proc. Natl. Acad. Sci. USA, 81, pp. 1539-43 (1984)!. Of the 600 colonies screened by hybridization, 140 give positive hybridization signals. Small cultures (5 ml; LB media+ampicillin) of each of these positive clones are prepared and the plasmid DNA is isolated by the rapid boiling method T. Maniatis et al., Molecular Cloning: A Laboratory Manual, Cold Spring Harbor Laboratory (1982)!. The isolated plasmids are cleaved with PstI, which is predicted to excise the cloned insert. Of the 140 clones which are analyzed in this way, the vast majority contain either small inserts (<1 kb) or do not release inserts upon cleavage with Pst1. We are, however, able to identify and map three cDNA clones to the P3/Sabin genome (pLL3-51, 69 and 82; see FIG. 2). Eight hundred colonies are screened by colony hybridization using anEcoRI fragment of pOLIO(Sabin) cDNA, which contains the 5'-most 747 nucleotides, as a probe. Of the thirty-three colonies that hybridize to the probe, five of the cDNAs represent nucleotides 85-779 of P2/Sabin. The origin of these cDNA clones is not known.

Subsequently, additional cDNA is annealed to pUC9 and transformed into DH5α cells. Eight hundred colonies are screened by colony hybridization using full-length pOLIO (Sabin) cDNA as a probe. Twenty-nine positive clones are analyzed by restriction enzyme cleavage. Eight of the positive clones that contain cDNA inserts are subjected to nucleotide sequencing from either end of the inserted DNA. Five of these clones (pLL3-239, 251, 253, 254 and 255) contains a cDNA that maps to the 3'-end of the P3/Sabin genome (FIG. 2).

As depicted in FIG. 2 (open bars), the 8 isolated and analyzed cDNA clones represent nucleotidase 3630 through the 3'-end of the P3/Sabin viral RNA genome.

To obtain additional 5'-cDNA clones, an oligonucleotide complementary to a stretch of bases from nucleotide 4317-4334 (see black shading in FIG. 2, pLL3-253) is synthesized. This oligonucleotide is used to prime cDNA synthesis on P3/Sabin viral RNA isolated according to Example 2 using cDNA synthesis methods described in Example 3. The resulting cDNAs are dc-tailed and cloned into PstI-cut, dG-tailed pUC9. The resulting plasmids are then used to transform DH5α cells. Transformants are screened by hybridization to pOLIO (Sabin 3). Eleven positive cDNA clones are isolated, analyzed by restriction enzyme digestion and sequenced (FIG. 2, black boxes). The eleven clones correspond to nucleotides 6 through 4329 of the P3/Sabin viral RNA.

Although cDNA clones representing all but the first 5 nucleotides of the P3/Sabin genome are identified, attempts to ligate together several of the 5'-cDNAs into a single cDNA proved difficult, due to the lack of convenient restriction sites. Therefore, one final round of cDNA cloning is undertaken in an attempt to obtain a single cDNA clone that represented approximately nucleotides 1 through 1900.

An oligonucleotide complementary to bases 1904-1922 is synthesized and used to prime cDNA synthesis on P3/Sabin RNA. The cDNA is then dc-tailed, inserted into dG-tailed, PstI-cut pUC9 and the resulting plasmid is used to transform DH5α cells. The transformants are again screened with pOLIO (Sabin 3). One of the resulting cDNA clones is identified as containing nucleotides 9-1900 of the viral RNA (FIG. 2, pLL3-307). This completes the cDNA cloning of the P3/Sabin genome.

EXAMPLE 6

Construction Of A Full-Length Poliovirus cDNA

The strategy for assembling the P3/Sabin cDNAs into a single full length cDNA is depicted in FIG. 3. First, clones pLL3-253 and pLL3-271 are assembled into a single clone by using a common HindIII site at nucleotide 4241. The resulting cDNA clone, which represents nucleotides 1500 through the 3' poly(A) of the genome, is called pLL3-I.

Next, the first 5 nucleotides of the P3/Sabin viral genome, which are missing from cDNA pLL3-277, are synthesized. This is achieved by synthesizing two complementary oligonucleotides using an automated oligonucleotide synthesizer and then annealing the oligonucleotides to one another. The resulting double-stranded oligonucleotide comprising from 5' to 3', a PstI site, nucleotides 1 through 34 of P3/Sabin cDNA and a BgeI sticky end, is then used to replace the 5'-most PstI/BglI fragment of pLL3-277. This results in a cDNA clone that represents nucleotides 1-1085. This clone is called pLL3-II.

The final step is the ligation of three SacI fragments representing nucleotides 1-747, 747-1895 and 1895 through the 3'-end, to form a full length P3/Sabin cDNA (termed pLL3-FL). This construction is performed in several separate steps using appropriate cDNA inserts from pLL3-II, pLL3-307 and pLL3-I.

EXAMPLE 7

Sequence Analysis Of The P3/Sabin cDNA

Sequence analysis of P3/Sabin cDNAs used to construct the full length cDNA is performed using three approaches. In the first approach, cDNAs are subcloned into M13 vectors. Nested deletions of the subcloned cDNA, using exonuclease III and mung bean nuclease E. Ozkaynak et al., "A Unidirectional Deletion Technique for the Generation of Clones for Sequencing, Biotechniques, 5, pp. 770-73 (1987)!, are then isolated and their nucleotide sequences determined.

In the second approach, cDNAs are isolated and digested with the RsaI. The resulting fragments are then "shotgunned" into M13 for sequence analysis. The last approach utilizes cDNA fragments that are isolated from clones and subcloned into M13 for sequence analysis.

Upon nucleotide sequencing of the entire P3/Sabin cDNA clone, 3 differences are found when the cDNA sequence is compared to the RNA sequence. These differences are summarized in the following table:

    ______________________________________                                         Nucleotide                                                                               P3/Sabin   P3/Sabin  Amino Acid                                      Position  cDNA       RNA       Change                                          ______________________________________                                         198       G          A         5' untranslated                                 4466      T          C         His to Tyr (2C)                                 6334      T          C         silent                                          ______________________________________                                    

The nucleotide error at position 198 in the cDNA is situated in the 5' untranslated region of the genome. The function of that region remains unknown. The presence of a T, not a C, in the cDNA at position 4466 would create a change in residue 118 of the 2C protein from histidine to tyrosine. Although the function of the 2C protein remains undefined, the predicted amino acid change appears to be significant. The nucleotide change at position 6334 is in the polymerase gene, but is silent with respect to amino acid change.

In order to obtain a full-length cDNA clone which corresponds exactly to the poliovirus RNA sequence, the protocol depicted in FIG. 4, panels A and B is followed. Specifically, the P3/Sabin cDNA insert pLL3-II is subcloned into the PstI site of bacteriophage M13 for oligonucleotide-directed mutagenesis of nucleotide 198 from G to A as described below. An oligonucleotide spanning nucleotides 190-206 and containing an A at position 198 is synthesized for this purpose. The oligonucleotide has the sequence: 3'-GGCGTATCTGACAAGGG-5'.

Mutagenesis is performed using a "T7-GEN In Vitro Mutagenesis Kit" (United States Biochemical, Cleveland, Ohio) and following the manufacturer's directions. The mutagenized insert is called pLL3-II(A). Following mutagenesis, pLL3-II(A) is sequenced to confirm the presence of an A at position 198. pLL3-II(A) is then removed from M13 with PstI and subsequently digested with SacI, which cuts at nucleotide 747. The fragment spanning nucleotides 1-747 is used to replace the corresponding fragment in pLL3-FL. The resulting full-length cDNA containing the mutagenized nucleotide at position 198 is referred to as pLL3-FL(A).

Next, pLL3-FL(A) is digested with PstI and the fragment spanning nucleotide 1 to 2604 is isolated. A portion of the isolated PstI fragment of pLL3-FL(A)--nucleotides 1 to 1922--is then subjected to polymerase chain reaction (PCR) using the following two oligonucleotides as primers:

1) 5'-CTGCAGTAATACGACTCACTATAGGTTAAAACAGCTCTGGGGTTG-3'; and

2) 5'-GAATCATGGTGTCTATCTC-3'.

Oligonucleotide 1 represents, in a 5'-to-3' orientation, a PstI site, the T7 promoter and nucleotides 1-20 of the positive strand of P3/Sabin cDNA. Oligonucleotide 2 represents nucleotides 1904-1922 of the negative strand of P3/Sabin cDNA. PCR is performed using the GeneAMp kit from Perkin-Elmer Cetus.

The double-stranded DNA produced by PCR is then cleaved with EcoRI, which cuts at nucleotide 784 of P3/Sabin cDNA. The resulting 0.8 kb fragment representing the T7 promoter and the 5' end of P3/Sabin cDNA is isolated and subcloned into the SspI/EcoRI-cut pBR322. The resulting plasmid, termed pVR309, is confirmed to contain the T7 promoter and nucleotides 1-784 of P3/Sabin cDNA.

Next, pLL3-FL is cut with EcoRI, which cuts at nucleotides 784 and 2867. The fragment spanning these nucleotides is isolated and ligated to EcoRI cut PVR309. The resulting plasmid, pVR312, contains the first 2867 nucleotides of P3/Sabin cDNA.

In order to change the nucleotides at 4466 and 6334 to correspond to the viral RNA sequence, oligonucleotide-directed mutagenesis is performed as described above. The insert pLL3-253 is subcloned into the PstI site of M13. Oligonucleotide 3'-CTCGTTTGTGGCATAAC-5' is used to change nucleotide 4466 from a T to a C. Oligonucleotide 3'-CCCATGGGGATGCACCG-5' is used to change nucleotide 6334 from a T to a C. The mutagenized nucleotides are confirmed by nucleotide sequencing. The resulting corrected cDNA fragment termed pLL3-253(C).

Next, pLL3-253(C) and pLL3-271 are assembled into a single clone by cleaving each with HindIII and subsequently ligating the resulting large fragments together. The resulting DNA, referred to as pLL3-I(C), represents nucleotides 1500 through the 3-poly(A) end of P3/Sabin. The complete cDNA insert of pLL3-I(C) is then isolated by partial PstI digestion. A partial digest is necessitated because of the presence of a PstI site at nucleotide 2604. pLL3-I(C) is then ligated to PstI cut pBR322. A portion of that insert is then removed from pBR322 by a SmaI/NruI digest. The excised piece spans nucleotide 2766 through the 3'-end of the cDNA and into pBR322. A SmaI/ NruI fragment is also excised from pVR312. This DNA represents sequences from the NruI site in pBR322 through the T7 promoter and the 5'-end of P3/Sabin cDNA and up to the SmaI site at nucleotide 2766. These two SmaI/NruI fragments are then ligated together to form a true P3/Sabin cDNA. The resulting plasmid is called pVR318. The presence of a full-length insert in pVR318 is confirmed by restriction enzyme analysis. However, upon sequencing of pVR318 it was determined that position 2493 was found to have a T, rather than a C.

In order to convert nucleotide 2493 to a C, a SacI/HindIII restriction fragment spanning nucleotides 1895 through 4241 from PVR318 is removed and replaced with a corresponding fragment from subclone pLL3-271. This scheme is depicted in FIG. 5. Specifically, pLL3-271 is digested with SacI and Hind III and the 2.346 kb fragment spanning nucleotides 1895 through 4241 is isolated by agarose gel electrophoresis. Plasmid PVR318 is partially digested with SacI and HindIII to remove the 2.346 kb fragment between nucleotides 1895 and 4241. The remaining 10.007 kb fragment is isolated by agarose gel electrophoresis and then ligated to the SacI/HindIII fragment from pLL3-271. The resulting ligation product, pLED3, represents a true full-length type 3 poliovirus vaccine strain cDNA. The sequence of pLED3 is depicted in FIG. 6, panels A-J.

EXAMPLE 8 Transfection Of Cells With Poliovirus P3/Sabin cDNA

Primary monkey kidney cells are grown to 80% confluency in duplicate 25 cm² flasks in Eagle's basal medium (BME) containing Hank's balanced salt solution, 0.35% bicarbonate and 10% calf serum. The cultures are maintained at 37° C. The medium is then removed and the cells transfected with pLED3 (prepared as in Example 7), added as a calcium phosphate precipitate in HEPES-buffered saline. After 20 minutes at room temperature, the cells are covered with Earle's lacteal maintenance medium and incubated for 4 hours at 37° C. The medium is then removed and the cells are washed once with fresh, warm medium. Two milliliters of 15% glycerol in HEPES-buffered saline are then added to the cells and incubation is continued for 3.5 minutes at 37° C. The glycerol is then removed and the cells are washed once again with fresh medium. One of the duplicate cultures is then covered with warm medium containing 1% Noble agar (Difco, Detroit, Mich.), while the other is covered with agar-free medium. The cultures are incubated at 34° C. for 1-5 days. Plaques are visualized by staining cells with 0.01% neutral red. Medium from the liquid culture is assayed for infectious poliovirus by plaque titration on Vero cells.

EXAMPLE 9

In Vitro Transcription Of P3/Sabin cDNA

In vitro transcription is performed by the method of van der Werf et al., Proc. Natl. Acad. Sci. USA, 83, pp. 2330-34 (1986). pLED3, containing the T7 promoter-P3/Sabin construct, is linearized at an appropriate restriction site outside of the poliovirus sequences and purified by phenol extraction and ethanol precipitation. This template DNA is then added to a mixture containing 20 mM sodium phosphate, pH 7.7, 8 mM MgCl₂, 10 mM DTT, 1 mM spermidine-HCl, 50 mM NaCl and 1 mM each of dATP, dCTP, dGTP and dUTP. RNA synthesis is initiated by the addition of 10-15 units of T7 RNA polymerase/μg linearized template and allowed to continue for 30 minutes at 37° C.

Following RNA synthesis, the DNA template is digested away with Dnase I. The remaining RNA is purified by phenol extraction and ethanol precipitation in the presence of 2.5 M ammonium acetate. The purified RNA is quantified by UV spectrophotometry.

EXAMPLE 10

Transfection Of Cells With In Vitro Transcribed P3/Sabin RNA

Semi-confluent monolayers of primary monkey kidney cell are prepared as described in Example 8. The cells are transfected with the RNA prepared in Example 9 using the method of A. Vaheri et al., "Infectious Poliovirus RNA: A Sensitive Method of Assay", Virology, 27, pp. 434-36 (1965). Specifically, the cell monolayers are washed with isotonic phosphate-buffered saline (PBS). After 15 minutes, the PBS is completely removed by aspiration. The monolayer is then coated with 0.25 ml of inoculum containing RNA and 500 μg/ml DEAE-dextran (Sigma, St. Louis, Mo.; 500,000 MW) in PBS. The infected monolayers are kept undisturbed at room temperature for 15 minutes, washed once with Earle's lacteal maintenance medium and then overlaid with either fresh medium or medium containing 1% agar, as described in Example 8. After 4-5 days at 34° C., virus is detected by plaque formation (in cultures containing agar) or by cytopathic effect.

Virus is then isolated and purified. Viral RNA is isolated by the techniques described previously. The isolated RNA is sequenced and the nucleotide at position 2493 is confirmed as cytosine, the same as the original source virus.

EXAMPLE 11

Evaluation Of The Effect Of Nucleotide 2493 On The Attenuation Of A Strain 3 Poliovirus

A derivative of an attenuated strain 3 poliovirus cDNA which contains a T instead of a C at nucleotide 2493 is constructed, such as pVR318. Viruses are produced from cDNAs with either Tor C at 2493 as in Examples 8 and 10. The resulting viruses are then tested for neurovirulence in monkeys according to standard protocols.

EXAMPLE 12

Increasing The Attenuation Of A Strain 3 Poliovirus

A strain 3 poliovirus cDNA which contains a T at position 2493 is subjected to site-directed mutagenesis to convert the T to a C. For example, pLED3 is digested with PstI to remove the poliovirus coding sequence. The poliovirus cDNA contains a single, internal PstI site at nucleotide 2604. Therefore, a PstI digestion yields a 5', 2.6 kilobase viral cDNA fragment and a 3', 4.8 kb viral cDNA fragment. The 2.6 kb fragment is isolated by standard techniques and subcloned into the unique PstI site of vector M13mp18. Nucleotide 2493 is then converted from a T to a C by site-directed mutagenesis using a "T7-GEN In Vitro Mutagenesis Kit" (United States Biochemical, Cleveland, Ohio) and following the manufacturer's directions. Following mutagenesis, the 2.6 kb piece of cDNA is removed from the vector, relegated to the 3' piece of viral cDNA and the intact cDNA is cloned into pBR322. Both the mutagenized cDNA and the original cDNA are used to produce viable poliovirus as in Example 8. The resulting viruses are assayed for attenuation according to standard protocols. The mutagenized virus produced by the cDNA containing a C at nucleotide 2493 is expected to be more attenuated than the original virus.

Confirmatory sequencing of the entire cDNA sequence in plasmid pLED3 revealed an additional A (adenosine) in a region where a run of six A's is normally found in the viral genome (positions 4133-4138). The following steps describe a method to derive pLED3.2 from pLED3:

Isolation and mutagenesis of erroneous pLED3 sequence

Starting with purified pLED3 plasmid DNA, digest the DNA to completion with restriction enzymes SacI and HindIII. By a method of choice (i.e., gel electrophoresis, HPLC) purify the 2,346 base pair SacI/HindIII fragment representing cDNA nucleotides 1895-4241.

Subclone the purified restriction fragment into bacteriophage M13 to enable oligo-directed deletion mutagenesis. An DNA oligonucleotide spanning nucleotides 4121-4150 is synthesized for this purpose. The sequence of this oligomer is: 5'-CGCCTCAGTAAATTTTTTCAACCAACTATC-3'.

Mutagenesis is performed using a "T7-GEN In Vitro Mutagenesis Kit" (United States Biochemical, Cleveland, Ohio) and following the manufacture's directions. The mutagenized insert is confirmed to possess 6 instead of 7 A's at positions 4133-4138 by sequence analysis. As above, the altered SacI/HindIII fragment is gel purified in preparation for ligation into the plasmid body lacking this restriction fragment.

Preparation of pLED3.2 from pLED3 plasmid

Starting with purified pLED3 plasmid DNA, partially digest the DNA with restriction enzymes SacI and HindIII. Purify the 10,007 base pair SacI/HindIII fragment corresponding the plasmid minus the 2,346 base pair SacI/HindIII fragment discussed above.

Construction of pLED3.2:

The purified plasmid body and mutagenized 2,346 base pair fragment are ligated together. Competent E. coli DH5α cells are transformed with the ligation products and tetracycline resistant colonies are selected. Identity of pLED3.2 is based on the confirmation of 6 A's at 4133-4138.

LED3 and VR318 viruses used in monkey NV test

The full-length cDNA in pLED3 (uncorrected) was infectious even though the additional A discussed above predicts a shift in the reading frame for viral proteins 2C, 3A, 3B, 3C and 3D. In vitro transcription of pLED3 cDNA using T7 RNA polymerase produced RNAs which possessed the erroneous seven A's as determined by direct sequence analysis. When these transcripts were used to transfect monkey kidney cells, virus was recovered which possessed the correct number of A's at this site.

Although the cDNAs in pLED3 and pVR318 differ by nucleotide composition at 2493 and by the number of A's between 4130-4136, the 2493 mutation was the only difference found to distinguish the viruses generated using these cDNAs. These cDNA-derived viruses were used to carry out the studies to assess the significance of 2493 mutation (described in the manuscript currently under review).

Polymerases are known to have problems copying from regions in which a single nucleotide is repeated several times. We proposed that T7 RNA polymerase may have generated by error some RNAs which contained six instead of seven A's while transcribing the run of seven A's from pLED3. Only those transcripts possessing six A's could produce virus upon transfection of cells. This proposed explanation seemed highly probable to a scientist who works on T-phage polymerase.

Recombinant DNA prepared by the processes of this invention is exemplified by a sample deposited with the American Type Culture Collection located at 12301 Parklawn Drive, Rockville Md. 20852 under the provisions of the Budapest Treaty on the International Recognition of the Deposit of Microorganisms for the Purposes of Patent Procedures. This sample was deposited on Apr. 17, 1990, identified as a plasmid containing the infectious full length copy of vaccine strain Sabin 3, and designated pLED3. This deposit was assigned ATCC accession number 40789.

EXAMPLE 13

Experimental Methods

Sabin 3-Specific Mutation Within the N-Terminal Region of VP1 is Attenuating

Preparation of cDNA clones

Viral RNA was extracted from Sabin 3 vaccine virus (Lederle-Praxis Biologicals, N.Y.) after pelleting and SDS disruption as described before (20). A cDNA library was made from the viral RNA template using reverse transcriptase and a procedure described elsewhere (12). Oligonucleotide-directed mutagenesis was performed on DNAs subcloned in M13 grown in E. coli CJ236, to change the cDNA sequence at single base positions (16). For this purpose, antisense oligonucleotides (17-mers) were synthesized with the target nucleotide positioned in the center of the sequence. Common restriction enzyme cleavage sites EcoRI, HindIII, SacI, SmaI) were used to construct full-length cDNA clones. The full-length cDNAs (7,459 bp) were constructed in vector pBR322, selected and amplified using E. coli DH5α (Bethesda Research Laboratories).

Cells

Vero cells were propagated as monolayers in Eagle's MEM with Earle's salts supplemented with 0.11% bicarbonate and 10% fetal bovine serum. Primary African green monkey kidney cells (PCMK) were initiated in BME with Hank's salts containing 0.035% bicarbonate and 10% serum. At 70% confluence, the cells were refed with BME (Earle's salts) containing 5% serum. During poliovirus infection, all cultures were maintained in modified Earle's lactalbumin hydrolyzate maintenance medium (LMM; pH 7.34 without serum.

Viruses

The Sabin 3 vaccine virus (RSO+2; Lederle-Praxis Biologicals, Pearl River, N.Y.) was prepared by a single passage of the rederived (RSO+1) manufacturing seed in PCMK cells. For experimental purposes, the same seed was used to produce RSO+2 vaccine virus in Vero cells. Virus NCl represents vaccine (SO+2) produced in PCMK cells from the original (SO+1) manufacturing seed. The cDNA-derived viruses recovered from Vero cells transfected with T7 RNA polymerase transcripts of pLED3 or pVR318 were designated LED3+1 and VR318+1. These "seed" viruses were amplified once more in Vero cells at a multiplicity of infection (MOI) of 0.1 at 33.5°±0.5° to create virus stocks LED3+2 and VR318+2. Particular attention was paid to preparing the LED3+2 and VR318+2 virus samples in a manner as similar as possible to actual vaccine (ie. passage level and culture conditions). The characterization studies were carried out using these final virus stocks.

Determination of Virus Titer

The titer of infectious virus in samples was determined most often by microtitration on HEp-2 cells and expressed as tissue culture infectious dose (TCID)₅₀ per ml (1). In some cases, infectious titer was measured by plaque titration on Vero cell monolayers and therefore expressed as plaque-forming units (pfu) per ml. Serial ten-fold dilutions of virus prepared in LMM were used to inoculate 25 cm² confluent monolayers and allowed to absorb at 22° C. The cells were overlaid with 1.0% Noble agar (Difco Laboratories) in MEM (Hank's) plus 2% fetal calf serum then incubated at 33.5°±0.5° C. After 3 days, plaques were visualized and counted after staining the cells with neutral red (0.01% solution). For a given sample, there is routinely a 0.6 log difference in absolute numbers determined using the two methods described above; the TCID₅₀ value is always greater.

Nucleotide Sequence Determination

RNA sequence was determined using synthetic oligonucleotides and the dideoxynucleotide chain termination method as described before (20). Sequencing of cloned cDNAs was performed using Sequenase DNA Sequencing Kit (U.S. Biochemical).

In Vitro RNA Synthesis and Transfection

DNA templates were prepared by digesting the plasmid DNA completely with PvuI restriction enzyme followed by extraction with phenol and ethanol precipitation. Transcription reaction mixtures containing 1 μg of DNA template were prepared as described by Moss et al. (12). RNA synthesis was initiated by addition of 30 units purified T7 RNA polymerase (Pharmacia) and the reactions were incubated at 37° C. for 90 mins. The DNA template and full-length transcription product were quantitated by comparing the intensity of appropriately-sized bands to known amounts of a standard after electrophoresis in agarose gels stained with ethidium bromide. The 9.0 kb HindIII fragment of bacteriophage lambda DNA was used as the standard for template; a 7.5 kb single-stranded RNA marker (Bethesda Research Laboratories) was used for transcript. Numerical values were obtained by performing densitometry on a photographic negative taken of the gel. Aliquots of the transcription reaction mixture containing 25 μg of full-length transcript were used to transfect Vero cell monolayers (25 cm2) according to the procedure described by van der Werf et al. (23). The cDNA-derived viruses were harvested when the cell monolayer was completely destroyed (+4 CPE).

Isotopic Labeling of Virus and PAGE

Vero cells (6.5×10⁶) were infected at an MOI of 16 pfu/cell. The virus was allowed to absorb for 1 hour, the cells were then washed twice and covered with LMM before incubation at 33.5° C. Four and a half hours later, the medium was changed to methionine-free MEM (Select Amine Kit, GIBCO). After one hour, the cells were replenished with fresh medium supplemented with ³⁵ S! methionine (specific activity, >1000 Ci/mmol; Amersham) to a final concentration of 60 μCi/ml and incubation continued. At+4 CPE (within 24 hr), the medium containing virus was clarified, first by low-speed centrifugation (2500 rpm, 20 mins, 4° C.) and then by microfiltration (0.22 μm Millex-GV; Millipore). Virus was pelleted by ultracentrifugation using a Beckman 70.1Ti rotor (70K, 1 h, 4° C.), and then resuspended and boiled in Laemmli sample buffer (10). The samples were subjected to SDS-polyacrylamide gel electrophoresis and the proteins visualized by autoradiography.

Virus Thermostabiltiy Curves

Multiple 3.0 ml samples of virus (approx. 10⁷.3 pfu/ml in LMM) were incubated at room temperature (22° C.) or in water baths at 37° C. or 42° C. At 24 hour intervals over the course of 5 days, one sample from each incubation set was removed and frozen at -20° C. Once all the sample incubations were completed, the virus titer in each sample was measured by plaque titration on Vero monolayers.

Virus Growth Curves

Vero cell monolayers (10⁷ cells/25 cm² flask) were infected with 4 pfu/cell and maintained as described above. At the indicated times post-infection, the medium from individual cultures was harvested then stored at 70° C.

Neurovirulence Test

Virus samples were tested for neurovirulence in Macca mulatto monkeys using two accepted procedures. As described in the United States Code of Federal Regulations (22), 0.2 ml or 0.5 ml of virus sample containing at least 10⁷.6 TCID₅₀ /ml is injected into the spinal cord (IS) or the thalamic region of each brain hemisphere (IT) of a monkey, for each test respectively. The WHO test (26) involves intraspinal injection of monkeys with 0.1 ml of virus (titer of 10⁶.5 -10⁷.5 TCID₅₀ /ml). After injection, the test animals are observed for 17-21 days for clinical signs of poliomyelitis. The animals are then sacrificed to allow histological examination of the brain and spinal cord for poliovirus lesions. The nervous tissue from each monkey is evaluated using a scoring method from 1 (low) to 4 (high) to reflect the severity of neuronal damage observed. The mean lesion score is an average of the scores recorded for monkeys within a group.

Statistical Analysis

Differences in mean lesion scores were analyzed by Analysis of Variance (ANOVA) model with Least Squares determinations employed for mean range testing. Frequencies of reactivity in the monkeys injected intrathalamically were tested for significance using Chi-square test.

Results

Construction of an authentic LED3 cDNA

Screening of the cDNA library through restriction analysis and nucleotide sequencing, identified four cDNA clones which represented all but the 5' six nucleotides of the Sabin 3 genome. The 5'-most cDNA clone was modified so that first six nucleotides of Sabin 3 cDNA were reconstructed and positioned under the control of bacteriophage T7 promoter by incorporation of a synthetic DNA oligonucleotide 5'-CTGCAGTAATACGACTCACTATAGGTTAAAACAGCTCTGGGGTTTG-3'! using the polymerase chain reaction. The orientation of the promoter ensures the synthesis of positive-sense RNA transcripts of the cloned cDNA. When the complete nucleotide sequence of the four cDNA subclones was compared to the LED3 RNA sequence (24) four nucleotide differences were identified. It is unclear whether the nucleotide differences represent errors made during reverse transcription or sequence divergence of a minor population within the vaccine virus sample. However, the full-length uncorrected cDNA was not infectious since virus nor CPE resulted from transfection of cells with T7-derived transcripts made of the cDNA. The sequence was corrected using oligonucleotide-directed mutagenesis (see Materials and Methods, Example 13) at three of these positions: 198 (G to A), 4466 (T to C) and 6334 (T to C). The last error, U instead of C at position 2493, fortuitously represented the Leon-like nucleotide at this position and was left untouched in the full-length cDNA in pVR318. This clone differs from LED3 RNA sequence only at n2493. The final step in the production of an authentic LED3 cDNA (pLED3.2) was accomplished by cDNA fragment exchange (SacI at n1895 to HindIII at n4241) between pVR318 and a vaccine cDNA subclone found to possess C at position 2493.

The full-length LED3 cDNA construct in cloning vector pBR322 is diagrammed in FIG. 7, panel A. Nucleotide sequence analysis across the junction between the 3' terminus of the Sabin 3 cDNA and plasmid pBR322 indicated a poly-A tail consisting of 27 A's. A string of 15 C's derived from cDNA cloning steps immediately follows the poly-A tail.

In vitro transcription of the cDNAs in pLED3 and pVR318

The plasmid DNAs were digested with PvuI to 1) generate a linear DNA template containing the entire poliovirus cDNA for transcription by T7 RNA polymerase and to 2) create a template with the shortest span of extraneous vector sequence after the 3' end of the cDNA. PvuI corresponds to the first restriction site after the 3' end of the cDNA (exactly 126 bp) which is not present within the cDNA. Two PvuI fragments (4.4 kb and 8.1 kb) are produced from pLED3 and pVR318, but only the 8.1 kb fragment contains the T7 promoter based on sequence analysis. (refer to FIG. 7, A). Panel B of FIG. 7 demonstrates that RNAs are efficiently synthesized by T7 RNA polymerase from the PvuI-restricted pLED3 and pVR318 and their size matches that predicted for runoff transcripts from the 8.1 kb fragment. In lanes 5 and 6, the two upper bands (low intensity) correspond to the 8.1 kb and 4.4 fragments produced by digestion of the plasmid DNAs with PvuI. Lane 4 contains HindIII/lambda DNA for a ds DNA size reference. The transcription product (most intense band in lanes 5 & 6) comigrates neatly with the 7.5 kb ssRNA marker (lanes 1-3) which is consistent for full-length (7585 nucleotides) RNAs. In comparison to virion RNA (lane 7), the in vitro transcripts of the poliovirus cDNA migrate slightly faster, perhaps due to the absence of covalently attached VPg protein. A transcription reaction containing 1 μl of template reproducibly produced 20-25 μg of full-length transcript. As shown here, the band corresponding to in 1 μg of a 50 μl reaction mixture (lane 5 or 6) is approximately the same intensity as the band representing 500 ng of 7.5 kb ssRNA (lane 2). Direct sequence analysis of these transcripts determined that there are two extra guanines at the 5' end immediately before the first poliovirus nucleotide and confirmed the presence of 126 bp of extraneous pBR322 sequence at the 3' end after the poly-A, poly-C tail.

When Vero cells were transfected with the RNAs described above, cytopathic effects consistent with poliovirus infection were observed in 24 hours and virus were harvested (+4 CPE) within 48-72 hours. The specific infectivity of the transcripts was found to be 1-2×10² pfu/μg, about 3% that of vaccine RNA. As determined by sequencing, the RNA from the recovered viruses lacks the extraneous pBR322 sequence at the 3' end. In addition, the genomes of these viruses were verified to possess the attenuated nucleotides at positions 472(U), 2034(U), and 6061(U); nucleotide 2493 was the only known difference between LED3 (2493-C) and VR318 (2493-U).

Mutation at n2493 correlates with altered VP1 mobility

The mutation identified at position 2493 in the consensus genome of Sabin 3 vaccine virus predicts a Ile-6→Thr substitution in VP1 (24). To determine if the biochemical properties of VP1 would be altered by this amino acid substitution, the UPI proteins of LED3 altered by this amino acid substitution, the VP1 proteins of LED3 and VR318 were compared. The VP1 from LED3 contains threonine at residue 6, whereas VR318 has isoleucine at this site. The difference in molecular weights between these amino acids is minimal and yet the VP1 proteins from these viruses are distinguishable by SDS-PAGE (FIG. 8). A possible explanation for this observed difference in VP1 migration derives from the fact that the Thr side chain has a hydroxyl group that can form a hydrogen bond, whereas the Ile side chain is hydrophobic. As a result, the VP1 of VR318 virus with Ile-6 may bind more SDS and therefore migrate faster than the VP1 of LED3. These data associate at least one biophysical change with the 2493 mutation. Whether this alteration in VP1 impacts the 3D structure of the virion is under evaluation.

Thermostability of LED3 and VR318 viruses

In the three-dimensional structure of poliovirus, the N-terminal region of VP1 is buried on the inside of native virions in close association with terminal regions of the other capsid proteins (6). In attempt to assess whether the 2493 mutation alters virion stability, LED3 and VR318 viruses were compared for susceptibility to thermal inactivation at several temperatures. As illustrated in FIG. 9, there was no loss in titer observed for either virus sample over a five day period at room temperature (22° C.). At 37° and 42° C., the titers of both virus samples decreased similarly. Despite heat treatment, the difference in plaque morphology between LED3 and VR318 viruses were preserved (see below).

Effect of n2493 on phenotypic markers

Small plaque size is often used to differentiate attenuated vaccine strains from virulent strains (13). In conducting a simple plaque titration of the cDNA-derived virus samples on Vero cells monolayers, it became apparent that the VR318 virus produces plaques that are obviously larger than those of LED3 virus. When the comparison included the pathogenic parent Leon strain, it was clear that VR318 plaques are of intermediate size (see FIG. 10). These data suggest that the single nucleotide difference at position 2493 between LED3 (C) and VR318 (U) is responsible for the increase in plaque size displayed by VR318 compared to LED3.

The traditional temperature-sensitivity (rct/40° C.) and "d" marker phenotypes of LED3 and VR318 viruses were also evaluated. Although these viruses were not distinguishable by either of these tests, both exhibited the attenuated phenotype compared to Leon virus (data not shown).

Growth curves of LED3 and VR318

To compare LED3 and VR318 replication, the growth kinetics of these viruses were compared in Vero cells under conditions known to be permissive for attenuated poliovirus (i.e, low temperature, high pH). FIG. 11 demonstrates there is no dramatic difference between the kinetics of virus release from Vero cells infected with LED3 and VR318. The titers achieved at 24 hours post-infection (10⁸ pfu/ml) indicated that neither of these viruses is severely debilitated under these conditions. A consistent, although subtle, difference was observed in the fold increase in titer between 8 and 12 hours post-infection suggesting that VR318 may have the ability to replicate faster than LED3. During this time period, the titer of VR318 increased 200-fold compared to only a 30-fold increase for LED3. Whether modification of culture conditions (i.e., lower pH) exaggerates the observed difference between LED3 and VR318 growth kinetics is under evaluation.

Neurovirulence of LED3 and VR318 in monkeys

Through the construction and neurovirulence testing of recombinant viruses derived from full-length Sabin 3 and Leon cDNAs, Westrop et a. (25) have correlated the attenuated phenotype of Sabin 3 with the point mutations at positions 472 and 2034. Applicants' identification of the Sabin 3-specific point mutation at 2493 (24), raised the question of whether this mutation might also be a determinant of attenuation.

The cDNA-derived viruses, LED3 and VR318, were compared to appropriate controls for neurovirulence as tested in monkeys by procedures contained in either the WHO or United States CFR requirements for the acceptance of vaccine lots (see Materials and Methods, Example 13). Differences between these procedures include the route of inoculation (intrathalamic & intraspinal versus only intraspinal) as well as amount (volume and titer) of the sample injected.

Table 1A lists the neurovirulence data from the CFR intraspinal (IS) test in which LED3 and VR318 were tested concurrently and compared to test results of actual vaccine (RSO+2) produced on primary monkey kidney cells (PCMK). Testing of RSO+2 (Vero) vaccine demonstrated that the use of Vero cells to produce vaccine virus as described had no effect on attenuation. The mean lesion scores produced by RSO+2 (PCMK), RSO+2 (Vero) and LED3+2 (Vero) were 0.52, 0.36 and 0.34, respectively. These data demonstrate that LED3 is no more neurovirulent than current vaccine virus. Interestingly, the mean lesion score of monkeys receiving VR318 was 1.31 which was significantly higher (p<0.01) than the scores produced by the other viruses. These data indicate that VR318 virus is not equivalent to current RSO+2 vaccine and that presence of C (LEd3 and RSO+2) instead of U (VR318) at nucleotide position 2493 is attenuating.

As above, the neurovirulence of LED3 and VR318 after intrathalamic route of injection into monkeys was compared to data obtained from four complete tests of current RSO+2 vaccine (Table 1B). Since brain tissue is less susceptible than spinal tissue to poliovirus infection, neurovirulence using this procedure is based essentially on whether any lesions are visible and in what percentage of the monkeys rather than a lesion score. As demonstrated by actual RSO+2 vaccine, a low level of reactivity in a group of monkeys (4.0%) is typical and desirable. Of the 10 monkeys receiving LED3, none exhibited lesions. VR318, however, produced lesions in 2 of 10 (20%) test animals. Although a group of 30 monkeys are required for a complete IT test, the increased percentage of positive monkeys in the VR318 group is highly unusual and predicts that VR318 would fail CFR IT data indicate that when compared to current RSO+2 vaccine, LED3 virus is equivalent and VR318 is more neurovirulent.

Using the WHO test procedure, LED3 and VR318 were evaluated concurrently with virus NCl, which is equivalent to the attenuated type 3 WHO test reference. As listed in Table 2, the mean lesion scores for LED3 and VR318 were 0.21 and 1.51, respectively. Confirmed by three different methods, the demonstration that LED3 is more attenuated than VR318 is unequivocal. The interpretation of the WHO test data is made somewhat difficult however by the performance of the attenuated reference, NCl. In this test, NCl produced a mean lesion score of 1.08, which falls between the values calculated for LED3 and VR318. Although a mean score of 1.08 is high for this test reference, the comparison of reactivity between NCl, LED3 and VR318 is valid since they were tested concurrently. A statistical comparison of the resultant scores points to the fact that by this test procedure, VR318 cannot be distinguished from the attenuated reference. Based on this preliminary data, it is unclear whether VR318 would fail in a full WHO test involving 24 monkeys per group. On the other hand, the lesion score associated with LED3 was shown to be significantly lower than either VR318 or the NCl reference (p<0.01).

Interestingly, the NCl reference virus represents vaccine material manufactured using Sabin original (SO), not the rederived Sabin original (RSO) seed. Nucleotide sequence determination at position 2493 of SO+2 vaccine (Lederle; same as NCl) revealed a 1:1 mixture of C and the variant U. Similar evaluation of current RSO+2 vaccine (Lederle) demonstrated only C at this position supporting the fact that the RSO seed is a purified derivative of the SO strain (24). Since test samples LED3, VR318 and NCl did not differ in nucleotide composition at position 472 based on RNA sequence determination, the increased level of neurovirulence exhibited by NCl compared to LED3 using the WHO procedure likely derives from the subpopulation of 2493-U variants in the NCl pool of virus.

The identification of a new Sabin 3-specific mutation at position 2493 which encodes an isoleucine to threonine change at the sixth amino acid of VP1 (24) is shown here to be a determinant of attenuation. The attenuation of Sabin 3 poliovirus has been correlated to point mutations at positions 472 and 2034 by others previously (25). To assess the contribution of the 2493 mutation, a virus was produced using a completely verified vaccine cDNA (LED3) and compared it to a derivative, VR318, which was the same except that it possessed the Leon-like nucleotide (U) instead of C at this position. The additional mutation in LED3 correlated to smaller plaque size as well as decreased neurovirulence in monkeys.

The data presented herein demonstrate that the biological properties associated with attenuation of the Sabin 3 vaccine strain were preserved in LED3. There are many benefits associated with the use of a Sabin 3 cDNA-derived seed strain. The stock volumes of both the original and rederived Sabin seeds, SO and RSO respectively, are limited due to the fact that they are virus plaque isolates. When seeds are stored in the form of a cloned, genetically defined cDNA, the limitations on seed supply are removed; in addition, such seeds can be preserved indefinitely. Without restriction on seed supply there is increased flexibility in the multiplicity of infection (MOI) that can be used to produce vaccine. Weeks-Levy et al. (24) showed that virus samples produced using accepted manufacturing seeds at higher MOI's are genetically more homogeneous. Since RNA viruses by their nature generate variants at height frequency than DNA viruses, an RNA virus seed established in the form of a cloned, genetically defined DNA should be the most homogeneous. By this approach, the amount of undesirable variants that could be selectively amplified during passage have been minimized which translates to increased genetic stability. Passage studies to assess whether LED3 constitutes a more genetically stable see as compared to other manufacturing seeds for Sabin 3 vaccine is currently under evaluation.

Based on nucleotide sequence determination, Sabin 3 variants possessing U at 2493 during passage in vitro were shown to accumulate more rapidly than variants possessing C at 472 (24). In the same study, the Sabin 3 component of different OPVs was found to vary greatly in the proportion of C and U at 2493. Although the data presented here do not address how the proportion of 2493-U variant affects the acceptability of vaccine lots, data comparing viruses LED3 (2493-C) and VR318 (2493-U) suggest that the result will depend on neurovirulence test method used for the evaluation. Of particular interest was the ability of the CFR intrathalamic test method to distinguish LED3 and VR318. The unusually high reactivity of VR318 in the brain compared to LED3 or RSO+2 vaccine suggests that the interaction between virus and brain tissue is enhanced when virus possesses U at 2493. Since the WhO test method does not evaluate vaccine by intrathalamic route of injection, our data suggest that this test would be less likely to detect 2493-U virus subpopulation in vaccine lots.

A detection method incorporating the polymerase chain reaction was used recently by Chumakov et al. (2) to determine that vaccine lots containing 472-C variants comprising greater than 1.17% of total virus failed the WHO neurovirulence test. The determination of equivalence at position 472 for LED3 and VR318 was based on sequence analysis of the viral RNA. It was determined that a 10% variant subpopulation is the limit of detection using this method (20). It is possible that the more sensitive PCR method would detect 472-C subpopulation in both LED3 and VR318 virus preparations. It is however unlikely that these virus samples would differ in proportion of 472-C variant because LED3 and VR318 are equivalent passage levels from the cloned cDNA and were generated under identical conditions. A preliminary evaluation of LED3 and VR318 using a PCR method to detect 472-C variants shows that these samples cannot be distinguished.

There are several lines of data supporting the selective advantage for Sabin 3 variants possessing U at 2493 in vivo, as well as in vitro. Stool isolate KW4, recovered 5 days post-vaccination, was shown previously to differ from the administered vaccine strain at three positions: 472=C, 2493=U, and 6061=U/C (20). Based on the data presented here, the intermediate level of neurovirulence observed for KW4 in that study can now be attributed to the mutation at 2493 as well as 472. Weeks-Levy et al. (24) found that two of three virus isolates recovered from nervous tissue (brain or spinal cord) of monkeys that had been injected intraspinally with NCl, the attenuated reference virus, possessed U at 2493. That isolate NCl-679B had U at 2493 without loss of the attenuated U at 472 may specifically relate to how this virus spread and replicated in the brain. The data are consistent with observations of increased neurovirulence for VR318 compared to LED3 as tested by CFR intrathalamic method.

The mechanism by which the change from U to C at 2493 attenuates LED3 compared VR318 is unclear. This mutation alters the sixth amino acid capsid protein VP1. In the three-dimensional structure of poliovirus, Hogle et al. (6) show that this region of VP1 is buried on the inside of the native virion. More recently, Fricks and Hogle (5) demonstrated that upon attachment to susceptible cells, the virion undergoes conformational changes resulting in release of capsid protein VP4 and externalization of the amino terminus of VP1. These authors demonstrated further that exposure of the amino terminus of VP1 was required for attachment to liposomes and proposed that these events play a role in the mechanism of cell entry. Consistent with these observations, Kirkegaard (8) described two poliovirus mutants with different small deletions in the amino terminal region of VP1 which flank either side of residue six as defective in the physical release of viral RNA from the capsid during normal infection. Both of these deletion mutants exhibited small plaque phenotype. From these data, it is easy to speculate that the mutation is Sabin 3 at position 2493 also affects viral uncoating.

In addition to the mutation in VP3 (2034) of Sabin 3, attenuation determinants have been mapped to the capsid proteins in Sabin 1 (14) and a type 2 strain (P2/712) which is closely related to Sabin 2 (16). Although these other mutations occur within capsid protein VP1, the structural mutation described in this study is the first one to be mapped to the amino terminal region of VP1.

While a number of embodiments of this invention have been described hereinabove, it is apparent that the basic constructions can be altered to provide other embodiments which utilize the processes, recombinant DNA molecules, cDNA molecules and transformed hosts of this invention. Therefore, it will be appreciated that the scope of this invention is to be defined by the claims appended hereto rather than by specific embodiments which have been presented hereinbefore by way of example.

                                      TABLE 1                                      __________________________________________________________________________     NEUROVIRULENCE OF LED3 AND VR318 STRAINS                                       USING CFR NV TEST PROCEDURE                                                                CELL   NUCLEOTIDE AT                                                                            NO. OF                                                                               MEAN LESION                                 GROUP                                                                              VIRUS   SUBSTRATE                                                                             472  2493 MONKEYS                                                                              SCORE                                       __________________________________________________________________________     A) INTRASPINAL.sup.a                                                           1   RSO+2   PCMK   U    C    24    0.52                                        2   RSO+2   VERO   U    C    12    0.36                                        3   LED3+2  VERO   U    C    16    0.34                                        4   VR318+2 VERO   U    U    16    1.31.sup.b                                  __________________________________________________________________________                 CELL   NUCLEOTIDE AT                                                                            NO. OF                                                                               PERCENT                                     GROUP                                                                              VIRUS   SUBSTRATE                                                                             472  2493 MONKEYS                                                                              POSITIVE                                    __________________________________________________________________________     B) INTRATHALAMIC.sup.c                                                         5   RSO+2   PCMK   U    C    120   4                                           6   LED3+2  VERO   U    C    10    0                                           7   VR318+2 VERO   U    U    10    20.sup.d                                    __________________________________________________________________________      .sup.a 0.2 ml of virus (titer ≧ 7.6 log TCID.sub.50 /ml)                administered intraspinally.                                                    .sup.b Group 4 > 1,2,3 (p < 0.01) by ANOVA and mean range testing.             .sup.c 0.5 ml of virus (titer ≧ 7.6 log TCID.sub.50 /ml)                administered intracerebrally into the thalamic region of each hemisphere.      .sup.d Group 7 > 5,6 (p < 0.05) by Chisquare test.                       

                                      TABLE 2                                      __________________________________________________________________________     NEUROVIRULENCE OF LED3 AND VR318 STRAINS                                       USING WHO NV TEST PROCEDURE.sup.a                                                          CELL   NUCLEOTIDE AT                                                                            NO. OF                                                                               MEAN LESION                                 GROUP                                                                              VIRUS   SUBSTRATE                                                                             472  2493 MONKEYS                                                                              SCORE                                       __________________________________________________________________________     1   LED3+2  VERO   U    C    6     0.21.sup.b                                  2   NC1.sup.c                                                                              PCMK   U    U/C  6     1.08                                        3   VR318+2 VERO   U    U    6     1.51                                        __________________________________________________________________________      .sup.a 0.1 ml of virus (titer 6.5 to 7.5 log TCID.sub.50 /ml) adminstered      intraspinally.                                                                 .sup.b Group 2 < 1,3 (p < 0.1) by ANOVA and mean range testing.                .sup.c NC1 (SO+2) is an attenuated type 3 reference for the WHO NV test. 

REFERENCES

1. Albrecht, P., J. C. Enterline, E. J. Boone, and M. J. Klutch. 1983. Polioviurs and polio anitbody assay in HEp-2 and Vero cell cultures. J. Biol. Stand. 11:91-97.

2. Chumakov, K. M., L. B. Powers, K. E. Noonan, I. B. Ronison and I. S. Levenbook. 1991. Correlation between amount of virus with altered nucleotide sequence and the monkey test for acceptability of oral poliovirus vaccine. Proc. Natl. Acad. Sci. U.S.A. 88:199-203.

3. Domingo, E. 1989. RNA virus evolution and the control of viral disease. Prog. Drug Res. 33;93-133.

4. Dunn, G., N. T. Begg, N. Cammack, and P. D. Minor. 1990. Virus exretion and mutation by infants following primary vaccination with live oral poliovaccine from two sources. J. Med. Virol. 32:92-95.

5. Fricks, C. E. and J. M. Hogle. 1990 Cell-induced conformational change in poliovirus: Externalization of the amino terminus of VP1 is responsible for liposome binding. J. Virol. 64:1934-1945.

6. Hogle, J. M., M. Chow, and D. J. Filman. 1985. Three-dimensional structure of poliovirus at 2.9A resolution. Science 229:1358-1365.

7. Kew, O. M., B. K. Nottay, M. H. Hatch, J. H. Nakano and J. F. Obijeski. 1981. Multiple changes can occur in the oral polio vaccines upon relication in humans. J. Gen. Virol. 56:337-347.

8. Kirkegaard, K. 1990. Mutations in VP1 of poliovirus specifically affect both encapsidation and release of viral RNA. J. Virol. 64:195-206.

9. Kohara, M., A. Shinobu, S. Kuge, B. L. Semler, T. Komatsu, M. Arita, H. Itoh and A. Nomoto. 1985. An infectious cDNA clone of the poliovirus Sabin strain could be used as a stable repository and inoculum for the oral polio live vaccine. Virology 151:21-30.

10. Laemmli, U. K. 1970. Cleavage of structural proteins during the assembly of the head of bacteriophage T4. Nature (London) 227:680-685.

11. Melnick, J. L., M. Benyesh-Melnick, and J. C. Brennan. 1959. Studies on live poliovirus vaccine. JAMA. 171:63-70.

12. Moss, E. G., R. E. O'Neill, and V. R. Racaniello. 1989. Mapping of attenuating sequences of an avirulent poliovirus type 2 strain. J. Virol. 63:1884-1890.

13. Nakano, J. H., M. H. Hatch, M. L. Thieme and B. Nottay. 1978. Parameters for differentiating vaccine-derived and wild poliovirus strains. Prog. Med. Virol. 24:178-206.

14. Nomoto, A. and E. Wimmer. 1987. Genetic studies of the antigenicity and the attenuation phenotype of poliovirus, p. 107-134. In W. C. Russell and J. W. Almond (ed.), Molecular basis of virus disease. Cambridge University Press, Cambridge.

15. Racaniello, V. R. 1988. Poliovirus neurovirulence. Adv. Virus Res. 34:217-246.

16. Ren, R., E. G. Moss, and V. R. Racaniello. 1991. Indentification of two determinants that attenuate vaccine-related type 2 poliovirus. J. Virol. 65:1377-1382.

17. Stanway, G., A. J. Cann, R. Hauptmann, P. Hughes, L. D. Clarke, R. C. Mountford, P. D. Minor, G. C. Schild, and J. W. Almond. 1983. The nucleotide sequence of poliovirus type 3 leon 12 a₁ b: comparison with poliovirus type 1. Nucleic Acids Res. 11:5629-5643.

18. Stanway, G., P. J. Hughes, R. C. Mountford, P. Reeve, P. D. Minor, G. C. Schild, and J. W. Almond. 1984. Comparison of complete nucleotide sequences of the genomes of the neurovirulent poliovirus P3/Leon/37 and its attenuated Sabin vaccine derivative P3/Leon12a₁ b. Proc. Natl. Acad. Sci. U.S.A. 81:1539-1543.

19. Stones, P. B., C. R. Macdonald, J. K. McDougall and P. F. Ramsbottom. 1964. Preparation and properties of a derivative of Sabin's type 3 poliovirus strain Leon 12a₁ b. 10th Symposium of the European Association against Poliomyelitis, Warsaw, pp. 390-397.

20. Tatem, J. M., C. Weeks-Levy, S. J. Mento, S. J. MiMichele, A. Georgiu, W. F. Waterfiled, B. Sheip, C. Costalas, T. Davies, M. B. Ritchey and F. R. Cano. J. Med. Virol., in press.

21. Toyoda, H., Kohara, M., Kataoka, Y., Suganuma, T., Omata, T., Imura, N., and Nomoto, A. 1984. Complete nucleotide sequences of all three poliovirus serotype genomes: Implication for gentic relationship, gene function and antigenic determinants. J. Mol. Biol. 174:561:585.

22. United States Code of Federal Regulations. 1990. Poliouvirus vaccine live oral. Title 21, Sec. 630.10-17.

23. van der Werf, S., J. Bradley, E. Wimmer, F. W. Studier and J. J. Dunn. 1986. Sythesis of infectious poliovirus RNA by purified T7 RNA polymerase. Proc. Natl. Acad. Sci. U.S.A. 83:2330-2334.

24. Weeks-Levy, C., J. M. Tatem, S. J. DiMichele, W. Waterfield, A. F. Georigu and S. J. Mento. Submitted for publication.

25. Westrop, G. D., K. A. Wareham, D. M. A. Evans, G. Dunn, P. D. Minor, D. I. Magrath, F. Taffs, S. Marsden, M. A. Skinner, G. C. Schild and J. W. Almond. 1989. Genetic basis of attenuation of the Sabin type 3 oral poliovirus vaccine. J. Virol. 63:1338-1344.

26. World Health Organization. 1990. Requirements for poliomyelitis vaccine (oral). W. H. O. Tech. Rep. Ser. 800:30-36.

    __________________________________________________________________________     SEQUENCE LISTING                                                               (1) GENERAL INFORMATION:                                                       (iii) NUMBER OF SEQUENCES: 9                                                   (2) INFORMATION FOR SEQ ID NO:1:                                               (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 7432 base pairs                                                    (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: DNA (genomic)                                              (iii) HYPOTHETICAL: NO                                                         (iv) ANTI-SENSE: NO                                                            (ix) FEATURE:                                                                  (A) NAME/KEY: CDS                                                              (B) LOCATION: 743..7361                                                        (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1:                                        TTAAAACAGCTCTGGGGTTGTTCCCACCCCAGAGGCCCACGTGGCGGCTAGTACACTGGT60                 ATCACGGTACCTTTGTACGCCTGTTTTATACTCCCTCCCCCGCAACTTAGAAGCATACAA120                TTCAAGCTCAATAGGAGGGGGTGCAAGCCAGCGCCTCCGTGGGCAAGCACTACTGTTTCC180                CCGGTGAGGCCGCATAGACTGTTCCCACGGTTGAAAGTGTCCGATCCGTTATCCGCTCAT240                GTACTTCGAGAAGCCTAGTATCGCTCTGGAATCTTCGACGCGTTGCGCTCAGCACTCAAC300                CCCGGAGTGTAGCTTGGGCCGATGAGTCTGGACAGTCCCCACTGGCGACAGTGGTCCAGG360                CTGCGCTGGCGGCCCACCTGTGGCCCAAAGCCACGGGACGCTAGTTGTGAACAGGGTGTG420                AAGAGCCTATTGAGCTACATGAGAGTCCTCCGGCCCCTGAATGCGGCTAATTCTAACCAT480                GGAGCAGGCAGCTGCAACCCAGCAGCCAGCCTGTCGTAACGCGCAAGTCCGTGGCGGAAC540                CGACTACTTTGGGTGTCCGTGTTTCCTTTTATTCTTGAATGGCTGCTTATGGTGACAATC600                ATAGATTGTTATCATAAAGCGAGTTGGATTGGCCATCCAGTGTGAATCAGATTAATTACT660                CCCTTGTTTGTTGGATCCACTCCCGAAACGTTTTACTCCTTAACTTATTGAAATTGTTTG720                AAGACAGGATTTCAGTGTCACAATGGGAGCTCAAGTATCATCCCAAAAAGTA772                        MetGlyAlaGlnValSerSerGlnLysVal                                                 1510                                                                           GGCGCTCACGAGAATTCTAACCGAGCCTACGGTGGTTCTACGATCAAC820                            GlyAlaHisGluAsnSerAsnArgAlaTyrGlyGlySerThrIleAsn                               152025                                                                         TACACCACAATTAATTATTATAAAGATTCCGCAAGTAATGCGGCGTCC868                            TyrThrThrIleAsnTyrTyrLysAspSerAlaSerAsnAlaAlaSer                               303540                                                                         AAACAAGATTACTCACAGGATCCATCAAAATTCACCGAGCCACTAAAG916                            LysGlnAspTyrSerGlnAspProSerLysPheThrGluProLeuLys                               455055                                                                         GACGTGCTCATAAAAACAGCTCCAGCACTCAATTCACCAAATGTGGAA964                            AspValLeuIleLysThrAlaProAlaLeuAsnSerProAsnValGlu                               606570                                                                         GCGTGTGGGTATAGTGATAGAGTGTTGCAACTCACTTTAGGCAATTCC1012                           AlaCysGlyTyrSerAspArgValLeuGlnLeuThrLeuGlyAsnSer                               75808590                                                                       ACTATTACTACACAGGAGGCAGCAAATTCAGTAGTGGCTTACGGACGT1060                           ThrIleThrThrGlnGluAlaAlaAsnSerValValAlaTyrGlyArg                               95100105                                                                       TGGCCTGAGTTTATTAGAGATGACGAAGCAAACCCGGTGGACCAACCA1108                           TrpProGluPheIleArgAspAspGluAlaAsnProValAspGlnPro                               110115120                                                                      ACTGAACCAGATGTGGCTACATGCAGATTCTACACACTAGACACTGTA1156                           ThrGluProAspValAlaThrCysArgPheTyrThrLeuAspThrVal                               125130135                                                                      ATGTGGGGTAAGGAGTCGAAAGGCTGGTGGTGGAAGTTACCTGACGCA1204                           MetTrpGlyLysGluSerLysGlyTrpTrpTrpLysLeuProAspAla                               140145150                                                                      CTGAGAGACATGGGTCTGTTTGGACAAAACATGTATTACCACTACCTA1252                           LeuArgAspMetGlyLeuPheGlyGlnAsnMetTyrTyrHisTyrLeu                               155160165170                                                                   GGAAGATCCGGGTACACTGTGCACGTGCAGTGTAATGCATCCAAATTT1300                           GlyArgSerGlyTyrThrValHisValGlnCysAsnAlaSerLysPhe                               175180185                                                                      CACCAAGGTGCACTCGGGGTGTTTGCGATTCCTGAGTATTGTCTGGCG1348                           HisGlnGlyAlaLeuGlyValPheAlaIleProGluTyrCysLeuAla                               190195200                                                                      GGTGACAGTGACAAGCAAAGGTACACTAGTTATGCAAATGCGAATCCA1396                           GlyAspSerAspLysGlnArgTyrThrSerTyrAlaAsnAlaAsnPro                               205210215                                                                      GGTGAAAGAGGGGGAAAATTTTACTCCCAATTCAACAAGGATAACGCA1444                           GlyGluArgGlyGlyLysPheTyrSerGlnPheAsnLysAspAsnAla                               220225230                                                                      GTAACATCCCCAAAAAGAGAGTTCTGCCCAGTGGATTATCTCCTGGGA1492                           ValThrSerProLysArgGluPheCysProValAspTyrLeuLeuGly                               235240245250                                                                   TGTGGGGTGTTACTGGGAAATGCCTTTGTATACCCACATCAAATCATT1540                           CysGlyValLeuLeuGlyAsnAlaPheValTyrProHisGlnIleIle                               255260265                                                                      AATCTGAGGACCAACAACAGCGCAACTATTGTCCTACCATATGTGAAT1588                           AsnLeuArgThrAsnAsnSerAlaThrIleValLeuProTyrValAsn                               270275280                                                                      GCTTTGGCCATTGATTCAATGGTTAAACACAACAACTGGGGCATTGCC1636                           AlaLeuAlaIleAspSerMetValLysHisAsnAsnTrpGlyIleAla                               285290295                                                                      ATTCTGCCCTTATCACCGCTGGATTTTGCTCAAGATTCATCAGTTGAA1684                           IleLeuProLeuSerProLeuAspPheAlaGlnAspSerSerValGlu                               300305310                                                                      ATTCCAATTACTGTGACAATTGCCCCAATGTGTAGCGAGTTCAACGGC1732                           IleProIleThrValThrIleAlaProMetCysSerGluPheAsnGly                               315320325330                                                                   CTTCGCAACGTGACTGCACCTAAATTTCAAGGACTACCAGTGTTGAAC1780                           LeuArgAsnValThrAlaProLysPheGlnGlyLeuProValLeuAsn                               335340345                                                                      ACTCCTGGTAGTAACCAGTACCTGACGTCAGACAACCACCAATCACCA1828                           ThrProGlySerAsnGlnTyrLeuThrSerAspAsnHisGlnSerPro                               350355360                                                                      TGCGCAATCCCAGAATTTGATGTCACTCCGCCTATTGATATCCCAGGT1876                           CysAlaIleProGluPheAspValThrProProIleAspIleProGly                               365370375                                                                      GAGGTTAAAAACATGATGGAGCTCGCCGAGATAGACACCATGATTCCT1924                           GluValLysAsnMetMetGluLeuAlaGluIleAspThrMetIlePro                               380385390                                                                      CTCAATTTGGAGAGCACCAAGAGAAACACAATGGACATGTACAGAGTT1972                           LeuAsnLeuGluSerThrLysArgAsnThrMetAspMetTyrArgVal                               395400405410                                                                   ACTCTGAGCGACAGTGCCGATCTATCGCAACCAATTTTGTGCTTGTCA2020                           ThrLeuSerAspSerAlaAspLeuSerGlnProIleLeuCysLeuSer                               415420425                                                                      CTATCCCCAGCATTTGATCCGCGCTTGTCACACACCATGCTTGGGGAA2068                           LeuSerProAlaPheAspProArgLeuSerHisThrMetLeuGlyGlu                               430435440                                                                      GTACTGAACTATTATACTCATTGGGCCGGGTCCTTGAAATTTACCTTC2116                           ValLeuAsnTyrTyrThrHisTrpAlaGlySerLeuLysPheThrPhe                               445450455                                                                      CTGTTCTGTGGTTCAATGATGGCTACGGGGAAAATCCTAGTGGCCTAT2164                           LeuPheCysGlySerMetMetAlaThrGlyLysIleLeuValAlaTyr                               460465470                                                                      GCACCACCAGGTGCACAACCCCCCACCAGCCGTAAGGAGGCTATGTTG2212                           AlaProProGlyAlaGlnProProThrSerArgLysGluAlaMetLeu                               475480485490                                                                   GGCACACATGTCATTTGGGATCTTGGCCTGCAATCATCTTGTACTATG2260                           GlyThrHisValIleTrpAspLeuGlyLeuGlnSerSerCysThrMet                               495500505                                                                      GTGGTGCCGTGGATTAGTAATGTGACATACAGACAGACTACACAAGAT2308                           ValValProTrpIleSerAsnValThrTyrArgGlnThrThrGlnAsp                               510515520                                                                      AGTTTCACTGAGGGCGGATATATCAGCATGTTCTACCAAACAAGAATT2356                           SerPheThrGluGlyGlyTyrIleSerMetPheTyrGlnThrArgIle                               525530535                                                                      GTGGTGCCACTGTCCACCCCTAAGAGTATGAGCATGCTGGGGTTTGTG2404                           ValValProLeuSerThrProLysSerMetSerMetLeuGlyPheVal                               540545550                                                                      TCAGCCTGTAATGATTTCAGTGTGCGATTGCTGCGAGACACCACTCAC2452                           SerAlaCysAsnAspPheSerValArgLeuLeuArgAspThrThrHis                               555560565570                                                                   ATTTCACAATCTGCGCTTCCACAGGGTATTGAAGATTTGACTTCTGAA2500                           IleSerGlnSerAlaLeuProGlnGlyIleGluAspLeuThrSerGlu                               575580585                                                                      GTTGCACAGGGCGCCCTAACTTTGTCACTCCCGAAGCAACAGGATAGC2548                           ValAlaGlnGlyAlaLeuThrLeuSerLeuProLysGlnGlnAspSer                               590595600                                                                      TTACCTGATACTAAGGCCAGTGGCCCGGCGCATTCCAAGGAGGTACCT2596                           LeuProAspThrLysAlaSerGlyProAlaHisSerLysGluValPro                               605610615                                                                      GCACTCACTGCAGTCGAGACTGGAGCCACCAATCCTCTGGCACCATCC2644                           AlaLeuThrAlaValGluThrGlyAlaThrAsnProLeuAlaProSer                               620625630                                                                      GACACAGTTCAAACGCGCCACGTAGTCCAACGACGCAGCAGGTCAGAG2692                           AspThrValGlnThrArgHisValValGlnArgArgSerArgSerGlu                               635640645650                                                                   TCCACAATAGAATCATTCTTCGCACGCGGGGCGTGCGTCGCTATTATT2740                           SerThrIleGluSerPhePheAlaArgGlyAlaCysValAlaIleIle                               655660665                                                                      GAGGTGGACAATGAACAACCAACCACCCGGGCACAGAAACTATTTGCC2788                           GluValAspAsnGluGlnProThrThrArgAlaGlnLysLeuPheAla                               670675680                                                                      ATGTGGCGCATTACATACAAAGATACAGTGCAGTTGCGCCGTAAGTTG2836                           MetTrpArgIleThrTyrLysAspThrValGlnLeuArgArgLysLeu                               685690695                                                                      GAGTTTTTCACATACTCTCGTTTTGACATGGAATTCACCTTCGTGGTA2884                           GluPhePheThrTyrSerArgPheAspMetGluPheThrPheValVal                               700705710                                                                      ACCGCCAACTTCACCAACGCTAATAATGGGCATGCACTCAACCAGGTG2932                           ThrAlaAsnPheThrAsnAlaAsnAsnGlyHisAlaLeuAsnGlnVal                               715720725730                                                                   TACCAGATAATGTACATCCCCCCAGGGGCACCCACACCAAAGTCATGG2980                           TyrGlnIleMetTyrIleProProGlyAlaProThrProLysSerTrp                               735740745                                                                      GACGACTACACTTGGCAAACATCTTCCAACCCGTCCATATTTTACACC3028                           AspAspTyrThrTrpGlnThrSerSerAsnProSerIlePheTyrThr                               750755760                                                                      TATGGGGCTGCCCCGGCGCGAATCTCAGTGCCATACGTGGGGTTAGCC3076                           TyrGlyAlaAlaProAlaArgIleSerValProTyrValGlyLeuAla                               765770775                                                                      AATGCTTACTCGCACTTTTACGACGGCTTCGCCAAGGTGCCATTGAAG3124                           AsnAlaTyrSerHisPheTyrAspGlyPheAlaLysValProLeuLys                               780785790                                                                      ACAGATGCCAATGACCAGATTGGTGATTCCTTGTACAGCGCCATGACA3172                           ThrAspAlaAsnAspGlnIleGlyAspSerLeuTyrSerAlaMetThr                               795800805810                                                                   GTTGATGACTTTGGTGTATTGGCAGTTCGTGTTGTCAATGATCACAAC3220                           ValAspAspPheGlyValLeuAlaValArgValValAsnAspHisAsn                               815820825                                                                      CCCACTAAAGTAACCTCCAAAGTCCGCATTTACATGAAACCCAAACAC3268                           ProThrLysValThrSerLysValArgIleTyrMetLysProLysHis                               830835840                                                                      GTACGTGTCTGGTGCCCTAGACCGCCGCGCGCGGTACCTTATTATGGA3316                           ValArgValTrpCysProArgProProArgAlaValProTyrTyrGly                               845850855                                                                      CCAGGGGTGGACTATAGGAACAACTTGGACCCCTTATCTGAGAAAGGT3364                           ProGlyValAspTyrArgAsnAsnLeuAspProLeuSerGluLysGly                               860865870                                                                      TTGACCACATATGGCTTTGGGCATCAGAATAAAGCTGTGTACACTGCT3412                           LeuThrThrTyrGlyPheGlyHisGlnAsnLysAlaValTyrThrAla                               875880885890                                                                   GGTTACAAGATCTGCAACTACCATCTCGCCACTAAGGAGGATTTACAA3460                           GlyTyrLysIleCysAsnTyrHisLeuAlaThrLysGluAspLeuGln                               895900905                                                                      AATGCTGTAAGCATCATGTGGAATAGAGACCTCTTGGTTGTTGAATCA3508                           AsnAlaValSerIleMetTrpAsnArgAspLeuLeuValValGluSer                               910915920                                                                      AAAGCTCAAGGTACCGACTCAATAGCAAGGTGCAATTGCAATGCAGGG3556                           LysAlaGlnGlyThrAspSerIleAlaArgCysAsnCysAsnAlaGly                               925930935                                                                      GTGTACTATTGTGAGTCCAGAAGGAAATACTACCCTGTGTCGTTTGTG3604                           ValTyrTyrCysGluSerArgArgLysTyrTyrProValSerPheVal                               940945950                                                                      GGACCCACCTTCCAATACATGGAGGCTAATGACTACTACCCAGCTAGA3652                           GlyProThrPheGlnTyrMetGluAlaAsnAspTyrTyrProAlaArg                               955960965970                                                                   TACCAATCCCACATGTTAATCGGGCACGGCTTTGCCTCACCAGGTGAC3700                           TyrGlnSerHisMetLeuIleGlyHisGlyPheAlaSerProGlyAsp                               975980985                                                                      TGTGGTGGTATCCTTAGGTGTCAACATGGCGTCATCGGAATCGTGACA3748                           CysGlyGlyIleLeuArgCysGlnHisGlyValIleGlyIleValThr                               9909951000                                                                     GCTGGTGGAGAGGGATTAGTCGCATTCTCTGACATAAGGGACTTGTAT3796                           AlaGlyGlyGluGlyLeuValAlaPheSerAspIleArgAspLeuTyr                               100510101015                                                                   GCTTACGAGGAAGAGGCCATGGAGCAGGGCATTTCAAACTATATTGAG3844                           AlaTyrGluGluGluAlaMetGluGlnGlyIleSerAsnTyrIleGlu                               102010251030                                                                   TCACTCGGTGCTGCGTTCGGTAGTGGGTTCACTCAGCAAATAGGGGAT3892                           SerLeuGlyAlaAlaPheGlySerGlyPheThrGlnGlnIleGlyAsp                               1035104010451050                                                               AAGATATCAGAACTAACCAGCATGGTGACCAGCACGATTACAGAGAAG3940                           LysIleSerGluLeuThrSerMetValThrSerThrIleThrGluLys                               105510601065                                                                   CTACTTAAAAACCTAATCAAAATTATTTCATCTCTGGTGATTATCACT3988                           LeuLeuLysAsnLeuIleLysIleIleSerSerLeuValIleIleThr                               107010751080                                                                   AGAAATTACGAAGATACCACCACAGTGCTCGCCACTCTAGCTCTTCTT4036                           ArgAsnTyrGluAspThrThrThrValLeuAlaThrLeuAlaLeuLeu                               108510901095                                                                   GGGTGTGATGTTTCACCGTGGCAATGGCTGAAGAAGAAAGCATGTGAC4084                           GlyCysAspValSerProTrpGlnTrpLeuLysLysLysAlaCysAsp                               110011051110                                                                   ACTTTGGAGATTCCCTATGTTATTAGACAGGGTGATAGTTGGTTGAAA4132                           ThrLeuGluIleProTyrValIleArgGlnGlyAspSerTrpLeuLys                               1115112011251130                                                               AAATTTACTGAGGCGTGCAACGCAGCTAAGGGGTTGGAATGGGTGTCC4180                           LysPheThrGluAlaCysAsnAlaAlaLysGlyLeuGluTrpValSer                               113511401145                                                                   AACAAAATCTCAAAATTTATTGACTGGTTGAGAGAAAGAATCATCCCA4228                           AsnLysIleSerLysPheIleAspTrpLeuArgGluArgIleIlePro                               115011551160                                                                   CAAGCCAGGGACAAGCTTGAGTTTGTAACCAAATTGAAACAGTTGGAA4276                           GlnAlaArgAspLysLeuGluPheValThrLysLeuLysGlnLeuGlu                               116511701175                                                                   ATGCTAGAGAATCAGATATCCACAATACACCAATCTTGTCCAAGTCAG4324                           MetLeuGluAsnGlnIleSerThrIleHisGlnSerCysProSerGln                               118011851190                                                                   GAACACCAGGAAATTTTGTTCAACAATGTACGCTGGTTGTCCATTCAA4372                           GluHisGlnGluIleLeuPheAsnAsnValArgTrpLeuSerIleGln                               1195120012051210                                                               TCCAAGAGATTCGCTCCATTGTACGCACTTGAGGCCAAGAGAATACAA4420                           SerLysArgPheAlaProLeuTyrAlaLeuGluAlaLysArgIleGln                               121512201225                                                                   AAGTTGGAACACACCATTAATAATTACATACAGTTCAAGAGCAAACAC4468                           LysLeuGluHisThrIleAsnAsnTyrIleGlnPheLysSerLysHis                               123012351240                                                                   CGTATTGAGCCAGTATGTTTGTTAGTGCATGGGAGCCCAGGTACAGGA4516                           ArgIleGluProValCysLeuLeuValHisGlySerProGlyThrGly                               124512501255                                                                   AAATCAGTTGCGACTAACCTAATTGCTAGAGCCATAGCTGAGAAAGAG4564                           LysSerValAlaThrAsnLeuIleAlaArgAlaIleAlaGluLysGlu                               126012651270                                                                   AACACCTCCACCTACTCGCTACCACCGGACCCGTCTCACTTTGATGGA4612                           AsnThrSerThrTyrSerLeuProProAspProSerHisPheAspGly                               1275128012851290                                                               TACAAACAACAAGGTGTGGTTATCATGGACGACCTAAACCAAAACCCG4660                           TyrLysGlnGlnGlyValValIleMetAspAspLeuAsnGlnAsnPro                               129513001305                                                                   GATGGGGCAGATATGAAGCTCTTTTGTCAAATGGTGTCCACTGTGGAG4708                           AspGlyAlaAspMetLysLeuPheCysGlnMetValSerThrValGlu                               131013151320                                                                   TTTATCCCACCTATGGCCTCGCTGGAAGAGAAAGGCATTCTGTTCACA4756                           PheIleProProMetAlaSerLeuGluGluLysGlyIleLeuPheThr                               132513301335                                                                   TCCAACTATGTTTTAGCCTCCACCAACTCCAGTCGCATCACACCACCT4804                           SerAsnTyrValLeuAlaSerThrAsnSerSerArgIleThrProPro                               134013451350                                                                   ACAGTAGCCCACAGTGACGCTCTGGCCAGGAGGTTCGCTTTCGATATG4852                           ThrValAlaHisSerAspAlaLeuAlaArgArgPheAlaPheAspMet                               1355136013651370                                                               GATATTCAAGTGATGGGCGAGTACTCCAGAGATGGTAAACTCAACATG4900                           AspIleGlnValMetGlyGluTyrSerArgAspGlyLysLeuAsnMet                               137513801385                                                                   GCAATGGCTACTGAGACGTGCAAGGACTGCCACCAACCAGCAAACTTC4948                           AlaMetAlaThrGluThrCysLysAspCysHisGlnProAlaAsnPhe                               139013951400                                                                   AAAAGATGCTGTCCTTTAGTGTGTGGTAAGGCAATTCAGTTAATGGAC4996                           LysArgCysCysProLeuValCysGlyLysAlaIleGlnLeuMetAsp                               140514101415                                                                   AAATCTTCCAGAGTTAGGTACAGTGTTGACCAGATTACTACAATGATT5044                           LysSerSerArgValArgTyrSerValAspGlnIleThrThrMetIle                               142014251430                                                                   ATCAACGAGAGAAACAGAAGATCTAACATTGGCAATTGCATGGAGGCT5092                           IleAsnGluArgAsnArgArgSerAsnIleGlyAsnCysMetGluAla                               1435144014451450                                                               TTGTTCCAAGGACCACTCCAGTACAAAGACCTGAAAATTGACATCAAG5140                           LeuPheGlnGlyProLeuGlnTyrLysAspLeuLysIleAspIleLys                               145514601465                                                                   ACGAGGCCCCCCCCTGAATGCATCAATGATCTGCTTCAAGCAGTTGAC5188                           ThrArgProProProGluCysIleAsnAspLeuLeuGlnAlaValAsp                               147014751480                                                                   TCCCAGGAAGTGAGGGATTATTGTGAAAAGAAAGGATGGATCGTCAAC5236                           SerGlnGluValArgAspTyrCysGluLysLysGlyTrpIleValAsn                               148514901495                                                                   ATCACTAGCCAAGTTCAAACAGAGAGAAACATTAACCGAGCAATGACC5284                           IleThrSerGlnValGlnThrGluArgAsnIleAsnArgAlaMetThr                               150015051510                                                                   ATTTTGCAGGCAGTGACAACTTTCGCCGCAGTGGCTGGTGTCGTGTAC5332                           IleLeuGlnAlaValThrThrPheAlaAlaValAlaGlyValValTyr                               1515152015251530                                                               GTCATGTACAAGTTATTCGCTGGACACCAGGGAGCATACACTGGTCTG5380                           ValMetTyrLysLeuPheAlaGlyHisGlnGlyAlaTyrThrGlyLeu                               153515401545                                                                   CCAAACAAAAGACCCAATGTGCCCACCATTAGAGCAGCAAAAGTGCAA5428                           ProAsnLysArgProAsnValProThrIleArgAlaAlaLysValGln                               155015551560                                                                   GGGCCTGGGTTTGACTATGCAGTGGCTATGGCTAAAAGAAACATTGTT5476                           GlyProGlyPheAspTyrAlaValAlaMetAlaLysArgAsnIleVal                               156515701575                                                                   ACAGCAACTACTAGCAAAGGGGAGTTCACAATGCTAGGAGTCCACGAC5524                           ThrAlaThrThrSerLysGlyGluPheThrMetLeuGlyValHisAsp                               158015851590                                                                   AACGTGGCCATTTTACCAACTCATGCCTCACCTGGTGAGAGTATTGTA5572                           AsnValAlaIleLeuProThrHisAlaSerProGlyGluSerIleVal                               1595160016051610                                                               ATTGATGGCAAAGAGGTTGAAATCCTAGACGCTAAAGCCCTCGAAGAT5620                           IleAspGlyLysGluValGluIleLeuAspAlaLysAlaLeuGluAsp                               161516201625                                                                   CAGGCAGGCACTAATCTGGAAATCACCATAATAACCCTCAAAAGAAAT5668                           GlnAlaGlyThrAsnLeuGluIleThrIleIleThrLeuLysArgAsn                               163016351640                                                                   GAAAAGTTCAGAGATATCAGACAACACATACCCACTCAAATCACCGAG5716                           GluLysPheArgAspIleArgGlnHisIleProThrGlnIleThrGlu                               164516501655                                                                   ACGAATGATGGAGTTCTGATTGTGAACACTAGTAAGTACCCCAACATG5764                           ThrAsnAspGlyValLeuIleValAsnThrSerLysTyrProAsnMet                               166016651670                                                                   TATGTTCCTGTCGGTGCTGTGACTGAGCAGGGATACCTAAATCTCGGT5812                           TyrValProValGlyAlaValThrGluGlnGlyTyrLeuAsnLeuGly                               1675168016851690                                                               GGGCGCCAGACTGCTCGTATTCTAATGTACAACTTTCCAACCAGAGCT5860                           GlyArgGlnThrAlaArgIleLeuMetTyrAsnPheProThrArgAla                               169517001705                                                                   GGTCAGTGTGGTGGAGTCATCACATGCACTGGGAAAGTCATCGGGATG5908                           GlyGlnCysGlyGlyValIleThrCysThrGlyLysValIleGlyMet                               171017151720                                                                   CACGTTGGTGGGAATGGTTCACATGGGTTTGCAGCGGCCCTGAAGCGG5956                           HisValGlyGlyAsnGlySerHisGlyPheAlaAlaAlaLeuLysArg                               172517301735                                                                   TCATACTTCACTCAGAGCCAAGGTGAAATCCAGTGGATGAGACCATCA6004                           SerTyrPheThrGlnSerGlnGlyGluIleGlnTrpMetArgProSer                               174017451750                                                                   AAGGAGGCAGGGTATCCAATTATAAACGCCCCAACCAAGACCAAGCTC6052                           LysGluAlaGlyTyrProIleIleAsnAlaProThrLysThrLysLeu                               1755176017651770                                                               GAGCCCAGTGCTTTCCACTATGTGTTTGAAGGAGTAAAGGAACCAGCA6100                           GluProSerAlaPheHisTyrValPheGluGlyValLysGluProAla                               177517801785                                                                   GTCCTCACAAAGAATGATCCCAGACTCAAAACAGACTTTGAAGAAGCA6148                           ValLeuThrLysAsnAspProArgLeuLysThrAspPheGluGluAla                               179017951800                                                                   ATCTTCTCTAAGTATGTAGGGAACAAGATCACTGAGGTGGATGAGTAC6196                           IlePheSerLysTyrValGlyAsnLysIleThrGluValAspGluTyr                               180518101815                                                                   ATGAAAGAGGCAGTGGACCATTATGCTGGACAACTTATGTCGCTGGAT6244                           MetLysGluAlaValAspHisTyrAlaGlyGlnLeuMetSerLeuAsp                               182018251830                                                                   ATCAGCACAGAGCAAATGTGTCTAGAAGACGCCATGTATGGTACTGAT6292                           IleSerThrGluGlnMetCysLeuGluAspAlaMetTyrGlyThrAsp                               1835184018451850                                                               GGTCTGGAGGCGCTAGATCTGTCTACCAGTGCCGGGTACCCCTACGTG6340                           GlyLeuGluAlaLeuAspLeuSerThrSerAlaGlyTyrProTyrVal                               185518601865                                                                   GCAATGGGGAAGAAGAAGAGAGATATCCTAAACAAGCAAACCAGAGAC6388                           AlaMetGlyLysLysLysArgAspIleLeuAsnLysGlnThrArgAsp                               187018751880                                                                   ACCAAAGAAATGCAAAGACTTTTGGACGCTTACGGAATCAACCTACCA6436                           ThrLysGluMetGlnArgLeuLeuAspAlaTyrGlyIleAsnLeuPro                               188518901895                                                                   TTAGTGACATATGTCAAGGACGAGCTGAGGTCCAAAACAAAAGTGGAA6484                           LeuValThrTyrValLysAspGluLeuArgSerLysThrLysValGlu                               190019051910                                                                   CAGGGAAAATCCAGACTGATTGAAGCTTCCAGTCTAAATGACTCAGTG6532                           GlnGlyLysSerArgLeuIleGluAlaSerSerLeuAsnAspSerVal                               1915192019251930                                                               GCCATGAGAATGGCATTTGGAAACCTTTATGCAGCATTCCACAGGAAT6580                           AlaMetArgMetAlaPheGlyAsnLeuTyrAlaAlaPheHisArgAsn                               193519401945                                                                   CCAGGGGTCGTCACTGGTAGTGCAGTTGGATGCGATCCAGACCTATTC6628                           ProGlyValValThrGlySerAlaValGlyCysAspProAspLeuPhe                               195019551960                                                                   TGGAGCAAGATCCCAGTGTTGATGGAAGAAAAGCTATTTGCCTTTGAT6676                           TrpSerLysIleProValLeuMetGluGluLysLeuPheAlaPheAsp                               196519701975                                                                   TACACAGGATACGACGCATCACTTAGCCCAGCTTGGTTTGAGGCACTC6724                           TyrThrGlyTyrAspAlaSerLeuSerProAlaTrpPheGluAlaLeu                               198019851990                                                                   AAGATGGTGTTAGAGAAAATTGGTTTTGGAGATAGAGTGGATTACATA6772                           LysMetValLeuGluLysIleGlyPheGlyAspArgValAspTyrIle                               1995200020052010                                                               GACTACCTTAACCATTCACACCACTTGTACAAAAACAAGATATATTGT6820                           AspTyrLeuAsnHisSerHisHisLeuTyrLysAsnLysIleTyrCys                               201520202025                                                                   GTTAAGGGCGGCATGCCATCTGGCTGCTCCGGCACTTCAATTTTTAAT6868                           ValLysGlyGlyMetProSerGlyCysSerGlyThrSerIlePheAsn                               203020352040                                                                   TCAATGATTAACAATTTGATCATTAGGACGCTTTTACTGAAAACCTAC6916                           SerMetIleAsnAsnLeuIleIleArgThrLeuLeuLeuLysThrTyr                               204520502055                                                                   AAGGGCATAGATTTGGACCACTTAAAAATGATTGCCTATGGTGACGAT6964                           LysGlyIleAspLeuAspHisLeuLysMetIleAlaTyrGlyAspAsp                               206020652070                                                                   GTAATAGCTTCCTATCCCCATGAGGTTGACGCTAGTCTCCTAGCCCAA7012                           ValIleAlaSerTyrProHisGluValAspAlaSerLeuLeuAlaGln                               2075208020852090                                                               TCAGGAAAAGACTATGGACTAACCATGACTCCGGCAGATAAATCTGCC7060                           SerGlyLysAspTyrGlyLeuThrMetThrProAlaAspLysSerAla                               209521002105                                                                   ACTTTTGAGACAGTCACATGGGAGAATGTAACTTTCTTGAAAAGATTC7108                           ThrPheGluThrValThrTrpGluAsnValThrPheLeuLysArgPhe                               211021152120                                                                   TTCAGAGCAGATGAGAAATACCCCTTCCTCATACATCCAGTAATGCCA7156                           PheArgAlaAspGluLysTyrProPheLeuIleHisProValMetPro                               212521302135                                                                   ATGAAGGAAATTCATGAATCAATCAGATGGACAAAAGATCCTCGGAAT7204                           MetLysGluIleHisGluSerIleArgTrpThrLysAspProArgAsn                               214021452150                                                                   ACGCAGGACCATGTACGCTCCTTGTGTCTATTGGCTTGGCACAACGGG7252                           ThrGlnAspHisValArgSerLeuCysLeuLeuAlaTrpHisAsnGly                               2155216021652170                                                               GAAGAAGAATACAACAAATTTTTAGCTAAAATTAGGAGTGTGCCAATC7300                           GluGluGluTyrAsnLysPheLeuAlaLysIleArgSerValProIle                               217521802185                                                                   GGAAGAGCTTTGTTGCTCCCAGAGTACTCAACATTGTACCGCCGTTGG7348                           GlyArgAlaLeuLeuLeuProGluTyrSerThrLeuTyrArgArgTrp                               219021952200                                                                   CTTGACTCATTTTAGTAACCCTACCTCAGTCGAATTGGATTGGGTCATACTGT7401                      LeuAspSerPhe                                                                   2205                                                                           TGTAGGGGTAAATTTTTCTTTAATTCGGAGG7432                                            (2) INFORMATION FOR SEQ ID NO:2:                                               (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 2206 amino acids                                                   (B) TYPE: amino acid                                                           (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: protein                                                    (xi) SEQUENCE DESCRIPTION: SEQ ID NO:2:                                        MetGlyAlaGlnValSerSerGlnLysValGlyAlaHisGluAsnSer                               151015                                                                         AsnArgAlaTyrGlyGlySerThrIleAsnTyrThrThrIleAsnTyr                               202530                                                                         TyrLysAspSerAlaSerAsnAlaAlaSerLysGlnAspTyrSerGln                               354045                                                                         AspProSerLysPheThrGluProLeuLysAspValLeuIleLysThr                               505560                                                                         AlaProAlaLeuAsnSerProAsnValGluAlaCysGlyTyrSerAsp                               65707580                                                                       ArgValLeuGlnLeuThrLeuGlyAsnSerThrIleThrThrGlnGlu                               859095                                                                         AlaAlaAsnSerValValAlaTyrGlyArgTrpProGluPheIleArg                               100105110                                                                      AspAspGluAlaAsnProValAspGlnProThrGluProAspValAla                               115120125                                                                      ThrCysArgPheTyrThrLeuAspThrValMetTrpGlyLysGluSer                               130135140                                                                      LysGlyTrpTrpTrpLysLeuProAspAlaLeuArgAspMetGlyLeu                               145150155160                                                                   PheGlyGlnAsnMetTyrTyrHisTyrLeuGlyArgSerGlyTyrThr                               165170175                                                                      ValHisValGlnCysAsnAlaSerLysPheHisGlnGlyAlaLeuGly                               180185190                                                                      ValPheAlaIleProGluTyrCysLeuAlaGlyAspSerAspLysGln                               195200205                                                                      ArgTyrThrSerTyrAlaAsnAlaAsnProGlyGluArgGlyGlyLys                               210215220                                                                      PheTyrSerGlnPheAsnLysAspAsnAlaValThrSerProLysArg                               225230235240                                                                   GluPheCysProValAspTyrLeuLeuGlyCysGlyValLeuLeuGly                               245250255                                                                      AsnAlaPheValTyrProHisGlnIleIleAsnLeuArgThrAsnAsn                               260265270                                                                      SerAlaThrIleValLeuProTyrValAsnAlaLeuAlaIleAspSer                               275280285                                                                      MetValLysHisAsnAsnTrpGlyIleAlaIleLeuProLeuSerPro                               290295300                                                                      LeuAspPheAlaGlnAspSerSerValGluIleProIleThrValThr                               305310315320                                                                   IleAlaProMetCysSerGluPheAsnGlyLeuArgAsnValThrAla                               325330335                                                                      ProLysPheGlnGlyLeuProValLeuAsnThrProGlySerAsnGln                               340345350                                                                      TyrLeuThrSerAspAsnHisGlnSerProCysAlaIleProGluPhe                               355360365                                                                      AspValThrProProIleAspIleProGlyGluValLysAsnMetMet                               370375380                                                                      GluLeuAlaGluIleAspThrMetIleProLeuAsnLeuGluSerThr                               385390395400                                                                   LysArgAsnThrMetAspMetTyrArgValThrLeuSerAspSerAla                               405410415                                                                      AspLeuSerGlnProIleLeuCysLeuSerLeuSerProAlaPheAsp                               420425430                                                                      ProArgLeuSerHisThrMetLeuGlyGluValLeuAsnTyrTyrThr                               435440445                                                                      HisTrpAlaGlySerLeuLysPheThrPheLeuPheCysGlySerMet                               450455460                                                                      MetAlaThrGlyLysIleLeuValAlaTyrAlaProProGlyAlaGln                               465470475480                                                                   ProProThrSerArgLysGluAlaMetLeuGlyThrHisValIleTrp                               485490495                                                                      AspLeuGlyLeuGlnSerSerCysThrMetValValProTrpIleSer                               500505510                                                                      AsnValThrTyrArgGlnThrThrGlnAspSerPheThrGluGlyGly                               515520525                                                                      TyrIleSerMetPheTyrGlnThrArgIleValValProLeuSerThr                               530535540                                                                      ProLysSerMetSerMetLeuGlyPheValSerAlaCysAsnAspPhe                               545550555560                                                                   SerValArgLeuLeuArgAspThrThrHisIleSerGlnSerAlaLeu                               565570575                                                                      ProGlnGlyIleGluAspLeuThrSerGluValAlaGlnGlyAlaLeu                               580585590                                                                      ThrLeuSerLeuProLysGlnGlnAspSerLeuProAspThrLysAla                               595600605                                                                      SerGlyProAlaHisSerLysGluValProAlaLeuThrAlaValGlu                               610615620                                                                      ThrGlyAlaThrAsnProLeuAlaProSerAspThrValGlnThrArg                               625630635640                                                                   HisValValGlnArgArgSerArgSerGluSerThrIleGluSerPhe                               645650655                                                                      PheAlaArgGlyAlaCysValAlaIleIleGluValAspAsnGluGln                               660665670                                                                      ProThrThrArgAlaGlnLysLeuPheAlaMetTrpArgIleThrTyr                               675680685                                                                      LysAspThrValGlnLeuArgArgLysLeuGluPhePheThrTyrSer                               690695700                                                                      ArgPheAspMetGluPheThrPheValValThrAlaAsnPheThrAsn                               705710715720                                                                   AlaAsnAsnGlyHisAlaLeuAsnGlnValTyrGlnIleMetTyrIle                               725730735                                                                      ProProGlyAlaProThrProLysSerTrpAspAspTyrThrTrpGln                               740745750                                                                      ThrSerSerAsnProSerIlePheTyrThrTyrGlyAlaAlaProAla                               755760765                                                                      ArgIleSerValProTyrValGlyLeuAlaAsnAlaTyrSerHisPhe                               770775780                                                                      TyrAspGlyPheAlaLysValProLeuLysThrAspAlaAsnAspGln                               785790795800                                                                   IleGlyAspSerLeuTyrSerAlaMetThrValAspAspPheGlyVal                               805810815                                                                      LeuAlaValArgValValAsnAspHisAsnProThrLysValThrSer                               820825830                                                                      LysValArgIleTyrMetLysProLysHisValArgValTrpCysPro                               835840845                                                                      ArgProProArgAlaValProTyrTyrGlyProGlyValAspTyrArg                               850855860                                                                      AsnAsnLeuAspProLeuSerGluLysGlyLeuThrThrTyrGlyPhe                               865870875880                                                                   GlyHisGlnAsnLysAlaValTyrThrAlaGlyTyrLysIleCysAsn                               885890895                                                                      TyrHisLeuAlaThrLysGluAspLeuGlnAsnAlaValSerIleMet                               900905910                                                                      TrpAsnArgAspLeuLeuValValGluSerLysAlaGlnGlyThrAsp                               915920925                                                                      SerIleAlaArgCysAsnCysAsnAlaGlyValTyrTyrCysGluSer                               930935940                                                                      ArgArgLysTyrTyrProValSerPheValGlyProThrPheGlnTyr                               945950955960                                                                   MetGluAlaAsnAspTyrTyrProAlaArgTyrGlnSerHisMetLeu                               965970975                                                                      IleGlyHisGlyPheAlaSerProGlyAspCysGlyGlyIleLeuArg                               980985990                                                                      CysGlnHisGlyValIleGlyIleValThrAlaGlyGlyGluGlyLeu                               99510001005                                                                    ValAlaPheSerAspIleArgAspLeuTyrAlaTyrGluGluGluAla                               101010151020                                                                   MetGluGlnGlyIleSerAsnTyrIleGluSerLeuGlyAlaAlaPhe                               1025103010351040                                                               GlySerGlyPheThrGlnGlnIleGlyAspLysIleSerGluLeuThr                               104510501055                                                                   SerMetValThrSerThrIleThrGluLysLeuLeuLysAsnLeuIle                               106010651070                                                                   LysIleIleSerSerLeuValIleIleThrArgAsnTyrGluAspThr                               107510801085                                                                   ThrThrValLeuAlaThrLeuAlaLeuLeuGlyCysAspValSerPro                               109010951100                                                                   TrpGlnTrpLeuLysLysLysAlaCysAspThrLeuGluIleProTyr                               1105111011151120                                                               ValIleArgGlnGlyAspSerTrpLeuLysLysPheThrGluAlaCys                               112511301135                                                                   AsnAlaAlaLysGlyLeuGluTrpValSerAsnLysIleSerLysPhe                               114011451150                                                                   IleAspTrpLeuArgGluArgIleIleProGlnAlaArgAspLysLeu                               115511601165                                                                   GluPheValThrLysLeuLysGlnLeuGluMetLeuGluAsnGlnIle                               117011751180                                                                   SerThrIleHisGlnSerCysProSerGlnGluHisGlnGluIleLeu                               1185119011951200                                                               PheAsnAsnValArgTrpLeuSerIleGlnSerLysArgPheAlaPro                               120512101215                                                                   LeuTyrAlaLeuGluAlaLysArgIleGlnLysLeuGluHisThrIle                               122012251230                                                                   AsnAsnTyrIleGlnPheLysSerLysHisArgIleGluProValCys                               123512401245                                                                   LeuLeuValHisGlySerProGlyThrGlyLysSerValAlaThrAsn                               125012551260                                                                   LeuIleAlaArgAlaIleAlaGluLysGluAsnThrSerThrTyrSer                               1265127012751280                                                               LeuProProAspProSerHisPheAspGlyTyrLysGlnGlnGlyVal                               128512901295                                                                   ValIleMetAspAspLeuAsnGlnAsnProAspGlyAlaAspMetLys                               130013051310                                                                   LeuPheCysGlnMetValSerThrValGluPheIleProProMetAla                               131513201325                                                                   SerLeuGluGluLysGlyIleLeuPheThrSerAsnTyrValLeuAla                               133013351340                                                                   SerThrAsnSerSerArgIleThrProProThrValAlaHisSerAsp                               1345135013551360                                                               AlaLeuAlaArgArgPheAlaPheAspMetAspIleGlnValMetGly                               136513701375                                                                   GluTyrSerArgAspGlyLysLeuAsnMetAlaMetAlaThrGluThr                               138013851390                                                                   CysLysAspCysHisGlnProAlaAsnPheLysArgCysCysProLeu                               139514001405                                                                   ValCysGlyLysAlaIleGlnLeuMetAspLysSerSerArgValArg                               141014151420                                                                   TyrSerValAspGlnIleThrThrMetIleIleAsnGluArgAsnArg                               1425143014351440                                                               ArgSerAsnIleGlyAsnCysMetGluAlaLeuPheGlnGlyProLeu                               144514501455                                                                   GlnTyrLysAspLeuLysIleAspIleLysThrArgProProProGlu                               146014651470                                                                   CysIleAsnAspLeuLeuGlnAlaValAspSerGlnGluValArgAsp                               147514801485                                                                   TyrCysGluLysLysGlyTrpIleValAsnIleThrSerGlnValGln                               149014951500                                                                   ThrGluArgAsnIleAsnArgAlaMetThrIleLeuGlnAlaValThr                               1505151015151520                                                               ThrPheAlaAlaValAlaGlyValValTyrValMetTyrLysLeuPhe                               152515301535                                                                   AlaGlyHisGlnGlyAlaTyrThrGlyLeuProAsnLysArgProAsn                               154015451550                                                                   ValProThrIleArgAlaAlaLysValGlnGlyProGlyPheAspTyr                               155515601565                                                                   AlaValAlaMetAlaLysArgAsnIleValThrAlaThrThrSerLys                               157015751580                                                                   GlyGluPheThrMetLeuGlyValHisAspAsnValAlaIleLeuPro                               1585159015951600                                                               ThrHisAlaSerProGlyGluSerIleValIleAspGlyLysGluVal                               160516101615                                                                   GluIleLeuAspAlaLysAlaLeuGluAspGlnAlaGlyThrAsnLeu                               162016251630                                                                   GluIleThrIleIleThrLeuLysArgAsnGluLysPheArgAspIle                               163516401645                                                                   ArgGlnHisIleProThrGlnIleThrGluThrAsnAspGlyValLeu                               165016551660                                                                   IleValAsnThrSerLysTyrProAsnMetTyrValProValGlyAla                               1665167016751680                                                               ValThrGluGlnGlyTyrLeuAsnLeuGlyGlyArgGlnThrAlaArg                               168516901695                                                                   IleLeuMetTyrAsnPheProThrArgAlaGlyGlnCysGlyGlyVal                               170017051710                                                                   IleThrCysThrGlyLysValIleGlyMetHisValGlyGlyAsnGly                               171517201725                                                                   SerHisGlyPheAlaAlaAlaLeuLysArgSerTyrPheThrGlnSer                               173017351740                                                                   GlnGlyGluIleGlnTrpMetArgProSerLysGluAlaGlyTyrPro                               1745175017551760                                                               IleIleAsnAlaProThrLysThrLysLeuGluProSerAlaPheHis                               176517701775                                                                   TyrValPheGluGlyValLysGluProAlaValLeuThrLysAsnAsp                               178017851790                                                                   ProArgLeuLysThrAspPheGluGluAlaIlePheSerLysTyrVal                               179518001805                                                                   GlyAsnLysIleThrGluValAspGluTyrMetLysGluAlaValAsp                               181018151820                                                                   HisTyrAlaGlyGlnLeuMetSerLeuAspIleSerThrGluGlnMet                               1825183018351840                                                               CysLeuGluAspAlaMetTyrGlyThrAspGlyLeuGluAlaLeuAsp                               184518501855                                                                   LeuSerThrSerAlaGlyTyrProTyrValAlaMetGlyLysLysLys                               186018651870                                                                   ArgAspIleLeuAsnLysGlnThrArgAspThrLysGluMetGlnArg                               187518801885                                                                   LeuLeuAspAlaTyrGlyIleAsnLeuProLeuValThrTyrValLys                               189018951900                                                                   AspGluLeuArgSerLysThrLysValGluGlnGlyLysSerArgLeu                               1905191019151920                                                               IleGluAlaSerSerLeuAsnAspSerValAlaMetArgMetAlaPhe                               192519301935                                                                   GlyAsnLeuTyrAlaAlaPheHisArgAsnProGlyValValThrGly                               194019451950                                                                   SerAlaValGlyCysAspProAspLeuPheTrpSerLysIleProVal                               195519601965                                                                   LeuMetGluGluLysLeuPheAlaPheAspTyrThrGlyTyrAspAla                               197019751980                                                                   SerLeuSerProAlaTrpPheGluAlaLeuLysMetValLeuGluLys                               1985199019952000                                                               IleGlyPheGlyAspArgValAspTyrIleAspTyrLeuAsnHisSer                               200520102015                                                                   HisHisLeuTyrLysAsnLysIleTyrCysValLysGlyGlyMetPro                               202020252030                                                                   SerGlyCysSerGlyThrSerIlePheAsnSerMetIleAsnAsnLeu                               203520402045                                                                   IleIleArgThrLeuLeuLeuLysThrTyrLysGlyIleAspLeuAsp                               205020552060                                                                   HisLeuLysMetIleAlaTyrGlyAspAspValIleAlaSerTyrPro                               2065207020752080                                                               HisGluValAspAlaSerLeuLeuAlaGlnSerGlyLysAspTyrGly                               208520902095                                                                   LeuThrMetThrProAlaAspLysSerAlaThrPheGluThrValThr                               210021052110                                                                   TrpGluAsnValThrPheLeuLysArgPhePheArgAlaAspGluLys                               211521202125                                                                   TyrProPheLeuIleHisProValMetProMetLysGluIleHisGlu                               213021352140                                                                   SerIleArgTrpThrLysAspProArgAsnThrGlnAspHisValArg                               2145215021552160                                                               SerLeuCysLeuLeuAlaTrpHisAsnGlyGluGluGluTyrAsnLys                               216521702175                                                                   PheLeuAlaLysIleArgSerValProIleGlyArgAlaLeuLeuLeu                               218021852190                                                                   ProGluTyrSerThrLeuTyrArgArgTrpLeuAspSerPhe                                     219522002205                                                                   (2) INFORMATION FOR SEQ ID NO:3:                                               (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 30 base pairs                                                      (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: DNA (genomic)                                              (iii) HYPOTHETICAL: NO                                                         (iv) ANTI-SENSE: NO                                                            (xi) SEQUENCE DESCRIPTION: SEQ ID NO:3:                                        CGCCTCAGTAAATTTTTTCAACCAACTATC30                                               (2) INFORMATION FOR SEQ ID NO:4:                                               (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 17 base pairs                                                      (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: DNA (genomic)                                              (iii) HYPOTHETICAL: NO                                                         (iv) ANTI-SENSE: NO                                                            (xi) SEQUENCE DESCRIPTION: SEQ ID NO:4:                                        GGCGTATCTGACAAGGG17                                                            (2) INFORMATION FOR SEQ ID NO:5:                                               (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 45 base pairs                                                      (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: DNA (genomic)                                              (iii) HYPOTHETICAL: NO                                                         (iv) ANTI-SENSE: NO                                                            (xi) SEQUENCE DESCRIPTION: SEQ ID NO:5:                                        CTGCAGTAATACGACTCACTATAGGTTAAAACAGCTCTGGGGTTG45                                (2) INFORMATION FOR SEQ ID NO:6:                                               (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 19 base pairs                                                      (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: DNA (genomic)                                              (iii) HYPOTHETICAL: NO                                                         (iv) ANTI-SENSE: NO                                                            (xi) SEQUENCE DESCRIPTION: SEQ ID NO:6:                                        GAATCATGGTGTCTATCTC19                                                          (2) INFORMATION FOR SEQ ID NO:7:                                               (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 17 base pairs                                                      (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: DNA (genomic)                                              (iii) HYPOTHETICAL: NO                                                         (iv) ANTI-SENSE: NO                                                            (xi) SEQUENCE DESCRIPTION: SEQ ID NO:7:                                        CTCGTTTGTGGCATAAC17                                                            (2) INFORMATION FOR SEQ ID NO:8:                                               (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 17 base pairs                                                      (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: DNA (genomic)                                              (iii) HYPOTHETICAL: NO                                                         (iv) ANTI-SENSE: NO                                                            (xi) SEQUENCE DESCRIPTION: SEQ ID NO:8:                                        CCCATGGGGATGCACCG17                                                            (2) INFORMATION FOR SEQ ID NO:9:                                               (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 46 base pairs                                                      (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: DNA (genomic)                                              (iii) HYPOTHETICAL: NO                                                         (iv) ANTI-SENSE: NO                                                            (xi) SEQUENCE DESCRIPTION: SEQ ID NO:9:                                        CTGCAGTAATACGACTCACTATAGGTTAAAACAGCTCTGGGGTTTG46                               __________________________________________________________________________ 

What is claimed is:
 1. A full-length and infectious recombinant DNA molecule comprising a true RNA virus cDNA derived from a vaccine strain 3 poliovirus, said cDNA being selected from the group consisting of:(a) pLED3.2; and (b) cDNAs which code for the polypeptides encoded by pLED3.2.
 2. A full-length and infectious recombinant DNA molecule comprising an RNA virus cDNA derived from a vaccine strain 3 poliovirus, said cDNA being selected from the group consisting of:(a) pLED3; and (b) cDNAs which code for the polypeptides encoded by pLED3.
 3. The recombinant DNA molecule according to any of claims 1, or 2, wherein said RNA virus cDNA is operatively linked to a promoter of RNA transcription.
 4. The recombinant DNA molecule according to claim 3, wherein said promoter is the T7 promoter.
 5. The host according to claim 4, wherein said host is E. coli.
 6. A host transformed with a recombinant DNA molecule according to any one of claims 1, or 2, said host being selected from the group consisting of bacteria, yeast and other fungi, insect cells and animal cells.
 7. The host according to claim 6, wherein said host is an animal cell selected from the group consisting of Vero cells, HeLa cells, COS cells, CV-1 cells and primary monkey kidney cells.
 8. A host transformed with a recombinant DNA molecule according to claim 3 said host being selected from the group consisting of bacteria, yeast and other fungi, insect cells and animal cells.
 9. The host according to claim 8, wherein said host is an animal cell selected from the group consisting of Vero cells, HeLa cells, COS cells, CV-1 cells and primary monkey kidney cells. 